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(57) Abstract: A vaccine for the treatment or prophylaxis of C. difficile associated disease comprises a C. difficile gene or a C. 

difficile peptide/polypeptide or a derivative or fragment or mutant or variant thereof which is immunogenic in humans. The gene 
£^ encodes a C. difficile surface layer protein, SlpA or variant or homologue thereof. The peptide/polypeptide is a C. difficile surface 
1^* layer protein, SlpA or variant or homologue thereof. The vaccine may comprise a chimeric nucleic acid sequence. 
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"Clostridium difficile vaccine" 

Introduction 

5 The invention relates to vaccines to provide immunological protection against C. 

difficile infection. 

Background 

10 Clostridium difficile is a common nosocomial pathogen and a major cause of 

morbidity and mortality among hospitalised patients throughout the world [Kelly et 
al., 1994]. Outbreaks of C. difficile have necessitated ward and partial hospital 
closure. With the increasing elderly population and the changing demographics of 
the population, C. difficile is set to become a major problem in the 21st century. The 

15 spectrum of C. difficile diseases range from asymptomatic carriage to mild diarrhoea 

to fulminant pseudomembranous colitis. Host factors rather than bacterial factors 
appear to determine the response to C. difficile [Cheng et al., 1997; McFarland et al., 
1991; Shim et al., 1998]. 

20 Reports indicate that hypogammaglobulinemia in children appears to predispose to 

the development of disease due to C. difficile and that therapy with intravenously 
administered gamma globulin can be associated with the clinical resolution of 
chronic relapsing colitis due to C difficile disease [Leung et al., 1991; Pelmutter et 
al., 1985]. A study by Mulligan et al. [1993] found elevated levels of 

25 immunoglobulins reactive with C. difficile in asymptomatic carriers as opposed to 

symptomatic patients. Recently it has been shown that patients who became 
colonised with C. difficile who had relatively low levels of serum IgG antibody 
against toxin A had a much greater risk of developing C. difficile diarrhoea [Kyne et 
al., 2000]. 

30 

It is clear that any advance in the understanding of C. difficile disease and methods 
of preventing or treating C. difficile diarrhoea (CDD) and other related diseases will 
be of major therapeutic potential. 
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Statements of Invention 

According to the invention there is provided a vaccine for the treatment or 
prophylaxis of C. difficile associated disease, the vaccine comprising a C. difficile 
5 gene or a C. difficile peptide/polypeptide or a derivative or fragment or mutant or 

variant thereof which is immunogenic in humans. 

The invention also provides a vaccine for the treatment or prophylaxis of C. difficile 
associated disease, the vaccine comprising a C difficile gene or C. difficile 
10 peptide/polypeptide or a derivative or fragment or mutant or variant thereof to which 

immunoreactivity is detected in individuals who have recovered from C. difficile 
infection. 

Preferably the gene encodes a C. difficile surface layer protein, SlpA or variant or 
15 homologue thereof. 

Preferably the peptide/polypeptide is a C. difficile surface layer protein, SlpA or 
variant or homologue thereof. 

20 Most preferably the vaccine comprises a chimeric nucleic acid sequence. Preferably 

the chimeric nucleic acid sequence is derived from the 5' end of the gene, encoding 
the mature N-terminal moiety of SlpA from C. difficile. 

In one embodiment of the invention the vaccine comprises a chimeric 
25 peptide/polypeptide. Preferably the amino acid sequence of the chimeric 

peptide/polypeptide is derived from the mature N-terminal moiety of SlpA from C. 
difficile. 

Preferably the vaccine of the invention contains an amino acid sequence SEQ ID 
30 No.l or a derivative or fragment or mutant or variant thereof. 
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Preferably the vaccine contains an amino acid sequence SEQ ID No. 2 or a derivative 
or fragment or mutant or variant thereof. 

In one embodiment of the invention the vaccine contains a nucleotide sequence SEQ 
5 ID No.3 or a derivative or fragment or mutant or variant thereof; a nucleotide 

sequence SEQ ID No.4 or a derivative or fragment or mutant or variant thereof; a 
nucleotide sequence SEQ ID No.5 or a derivative or fragment or mutant or variant 
thereof; a nucleotide sequence SEQ ID No. 6 or a derivative or fragment or mutant or 
variant thereof; a nucleotide sequence SEQ ID No. 7 or a derivative or fragment or 
10 mutant or variant thereof; a nucleotide sequence SEQ ID No.8 or a derivative or 

fragment or mutant or variant thereof; a nucleotide sequence SEQ ID No. 9 or a 
derivative or fragment or mutant or variant thereof or a nucleotide sequence SEQ ID 
No. 10 or a derivative or fragment or mutant or variant thereof. 

15 Preferably the vaccine of the invention is in combination with at least one other C. 

difficile sub-unit. 

The invention provides a vaccine for the treatment or prophylaxis of C. difficile 
associated disease, the vaccine comprising the mature N-terminal moiety of a surface 
20 layer protein, SlpA of C. difficile or variant or homologue thereof which is 

immunogenic in humans. 

Most preferably the N-terminal moiety of SlpA contains an amino acid sequence 
SEQ ID No. 1. 

25 

In one embodiment of the invention the N-terminal moiety of SlpA contains an 
amino acid sequence SEQ ID No. 2. 



The invention also provides a vaccine for the treatment or prophylaxis of C. difficile 
associated disease, the vaccine comprising an immunodominant epitope derived 
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from a C. difficile gene or a C. difficile peptide/polypeptide or a derivative or 
fragment or mutant or variant thereof which is immunogenic in humans. 

Preferably the vaccine of the invention comprises a pharmaceutically acceptable 
5 carrier. Most preferably the vaccine is in combination with a pharmacologically 

suitable adjuvant. Ideally the adjuvant is interleukin 12. Alternativley the adjuvant 
may be a heat shock protein. 

In one embodiment of the invention the vaccine comprises at least one other 
10 pharmaceutical product. 

The pharmaceutical product may be an antibiotic, selected from one or more 
metronidazole, amoxycillin, tetracycline or erythromycin, clarithromycin or 
tinidazole. 



15 



In one embodiment of the invention the pharmaceutical product comprises an acid- 
suppressing agent such as omeprazole or bismuth salts. 



The vaccine of the invention may be in a form for oral administration, intranasal 
20 administration, intravenous administration or intramuscular administration. 

In one embodiment of the invention the vaccine includes a peptide delivery system. 

The invention also provides an immunodominant epitope derived from a C. difficile 
25 gene or a C. difficile peptide/polypeptide or a derivative or fragment or mutant or 

variant thereof. Preferably the C. difficile peptide/polypeptide contains an amino 
acid sequence SEQ ID No.l or SEQ ID No.2 or a derivative or fragment or mutant or 
variant thereof. 

30 In one embodiment of the invention the C difficile peptide/polypeptide contains an 

amino acid sequence SEQ ID No.3 or SEQ ID No.4 or SEQ ID No.5 or SEQ ID 
No.6 or SEQ ID No.7 or SEQ ID No.8 or SEQ ID No. 9 or SEQ ID No. 10 or a 
derivative or fragment or mutant or variant thereof. 
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The invention further provides a chimeric nucleic acid sequence derived from the 5' 
end of the slpA gene encoding the mature N-terminal moiety of SlpA from C. 
difficile which is immunogenic in humans. 

5 The invention also provides a chimeric peptide/polypeptide wherein the amino acid 

sequence of the chimeric peptide/polypeptide is derived from the mature N-terminal 
moiety of SlpA from C. difficile. 

The invention provides a C. difficile peptide comprising SEQ ID No. 1 or SEQ ID 
10 No. 2 or SEQ ID No. 3 or SEQ ID No. 4 or SEQ ID No. 5 or SEQ ID No. 6 or SEQ 

ID No. 7 or SEQ ID No. 8 or SEQ ID No. 9 or SEQ ID No. 10. 

One aspect of the invention provides for the use of a C. difficile gene or a C. difficile 
peptide/polypeptide or a derivative or fragment or mutant or variant thereof which is 
15 immunogenic in humans in the preparation of a medicament for use in a method for 

the treatment or prophylaxis of C. difficile infection or C. difficile associated disease 
in a host. 

Preferably the medicament which is prepared is a vaccine of the invention. 

20 

The invention also provides a method for preparing a vaccine for prophylaxis or 
treatment of C. difficile associated disease, the method comprising; 

obtaining a C. difficile gene or a C. difficile peptide/polypeptide or a 
25 derivative or fragment or mutant or variant thereof which is immunogenic in 

humans; and 

forming a vaccine preparation comprised of said gene or peptide/polypeptide 
or derivative or fragment or mutant or variant, which is suitable for 
30 administration to a host and which when administered raises an immune 

response. 
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Preferably the C. difficile peptide/polypeptide contains an amino acid sequence SEQ 
ID No.l or SEQ ID No.2 or a derivative or fragment or mutant or variant thereof. 

Most preferably the C. difficile gene contains an amino acid sequence SEQ ID No.3 
5 or SEQ ID No.4 or SEQ ID No.5 or SEQ ID No.6 or SEQ ID No. 7 or SEQ ID No.8 

or SEQ ID No.9 or SEQ ID No. 10 or a derivative or fragment or mutant or variant 
thereof. 

The invention further provides a method for prophylaxis or treatment of C. difficile 
10 associated disease, the method comprising; 

obtaining a C. difficile gene or a C. difficile peptide/polypeptide or a 
derivative or fragment or mutant or variant thereof which is immunogenic in 
humans; 

15 

forming a vaccine preparation comprised of said gene or peptide/polypeptide 
or derivative or fragment or mutant or variant, and 

administering the vaccine preparation to a host to raise an immune response. 

20 

One aspect of the invention provides monoclonal or polyclonal antibodies or 
fragments thereof, to a C. difficile peptide/polypeptide or a derivative or fragment or 
mutant or variant thereof which is immunogenic in humans. 

25 Another aspect of the invention provides monoclonal or polyclonal antibodies or 

fragments thereof, to C. difficile peptide/polypeptide or a derivative or fragment or 
mutant or variant thereof to which immunoreactivity is detected in individuals who 
have recovered from C. difficile infection. 



30 



The invention also provides purified antibodies or serum obtained by immunisation 
of an animal with a vaccine of the invention. 
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The invention provides the use of the antibodies or fragments of the invention in the 
preparation of a medicament for treatment or prophylaxis of C. difficile infection or 
C. difficile associated disease. 

5 Preferably the antibodies or serum are used in the preparation of a medicament for 

treatment or prophylaxis of C. difficile infection or C. difficile associated disease. 

Most preferably the antibodies or fragments or serum of the invention are used in 
passive immunotherapy for established C. difficile infection. 

10 

In one embodiment of the invention the antibodies or fragment or serum of the 
invention are used for the eradication of C. difficile associated disease. 

The invention also provides use of interleukin 12 as an adjuvant in C. difficile 
15 vaccine. 

The invention further provides use of humanised antibodies or serum for passive 
vaccination of an individual with C. difficile infection. 

20 

Brief Description of the Drawings 

The invention will be more clearly understood from the following description thereof 
given by way of example only with reference to the accompanying figures, in 
25 which:- 

Fig. 1A is a Western blot showing recognition of antigens from a crude 
extract of C. difficile 171500 (PCR type 1) by serum antibodies from a 
patient infected with this strain. Lane 1: Pre-infection; Lane 2: Early acute; 
30 Lane 3: Late acute; Lane 4: Convalescent; 
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Fig. IB is a Western blot showing recognition of antigens from a crude 
extract of C difficile 170324 (PCR type 12) by serum antibodies from a 
patient infected with this strain. Lane 1: Pre-infection; Lanes 2-5: Acute; 
Lanes 6-7: Convalescent; 

5 

Fig. 2. is a Western blot showing recognition of antigens from two C. difficile 
strains of different type by serum from convalescent patients. 

Lane 1: Strain 170324 (PCR type 12), crude antigen preparation 
Lane 2: Strain 170324, surface layer protein preparation 
10 Lane 3: Strain 171500 (PCR type 1), crude antigen preparation 

Lane 4: Strain 171500, surface layer protein preparation. 
Molecular mass markers (kDa) are shown on the left; and 



Fig. 3 is an SDS-PAGE gel showing crude SLP preparations from selected 
15 strains of C. difficile. The gel contains 12% acrylamide, and has been stained 

for protein with Coomassie Blue. Each lane contains 5 \ig of protein. 
Molecular weight markers are shown on the left. 

Lane 1: 171500 (PCR type 1) 
20 Lane 2: 172450 (PCR type 5) 

Lane 3: 170324 (PCR type 12) 

Lane 4: 171448 (PCR type 12) 

Lane 5: 171862 (PCR type 17) 

Lane 6: 173644 (PCR type 31) 
25 Lane 7: 170444 (PCR type 46) 

Lane 8: 170426 (PCR type 92) 



Detailed Description of the invention 

Two antigenic peptides containing SEQ ID No. 1 and SEQ ID No. 2, associated with 
two common infecting types of C. difficile, were found to be immunogenic in 
humans. The antigenic peptides were found to induce a strong immune response in 
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individuals who recover from C. difficile infection. Individuals who have recovered 
from C. difficile infection are those individuals who have been exposed to C difficile 
or something strongly related and have recovered. This includes individuals where a 
carrier state exists in that the C. difficile infection has not and will not necessarily 
5 become clinically significant. 

These antigenic peptides were found to be products of the slpA gene from C. difficile 
which is the structural gene for the surface layer protein, SlpA The gene or its 
products are therefore ideal candidates for the preparation of vaccines against C. 
10 difficile. 

Surface layer proteins (SLPs), also known as S-layers or crystalline surface layers, 
are associated with a wide range of bacterial species. They form a 2-dimensional 
array, which covers the surface of the cell completely, and grows with the cell 

15 [Sleytr et al., 1993]. The molecular weight can range from 40 000 to 200 000 Da. 

The proteins are typically acidic, contain a large proportion of hydrophobic amino 
acid residues, and have few or no sulphur-containing amino acid residues. 
Glycosylated S-layer proteins occur in some species. The precise function of S- 
layers is not always known, but since they comprise approximately 15% of the cell 

20 protein, it seems likely that they are important for in vivo functioning of the 

organism. In Gram positive organisms, the SLP has been shown to delay or prevent 
the excretion of degradative enzymes from the cell to the outside milieu, and may 
thereby create a space analagous to the periplasmic space of Gram negative bacteria. 
Many pathogenic species possess SLPs, which have been ascribed functions such as 

25 antiphagocytosis {Campylobacter fetus), and inhibition of complement-mediated 

killing (Aeromonas salmonicida) . 

Kawata et al. [1984] described the SLPs of Clostridium difficile. They showed the 
S-layer to be composed of 2 polypeptides, and demonstrated size heterogeneity for 
30 the polypeptides from different strains. Delmee et al. [1986] showed that crude 

extracts from C. difficile strains of different serotype showed different polypeptide 
profiles in SDS-PAGE. Poxton et al. [1999] made similar observations using 
purified SLP preparations. Slide agglutination [Delmee et al., 1990] has identified 21 
different serotypes, apparently distinguished by the heterogeneity of the SLP. 

35 
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Pantosti et al. [1989] isolated C. difficile from a number of patients with antibiotic- 
associated diarrhoea, and prepared SLPs from them.. Cerquetti et al. [2000] 
published N-terminal sequences of SLPs from several strains, indicating wide 
differences between strains.. In 2000 the complete DNA sequence of the C. difficile 
5 genome was published (available at web address 

http://www.sanger.ac.Uk/Proiects/C difficile/) . 

The peptides of the invention were found to be encoded by a single open reading 
frame (ORF) named sip A from C. difficile. The peptides identified in our clinical 
10 study correspond to a lower molecular weight moiety of the sip A gene product. 

Since an immune response is also mounted against a higher molecular weight slpA 
gene product (Fig. 2), this entity may also be included in a vaccine. 

The slpA gene has been sequenced from a number of strains corresponding to 

15 different PCR types. The sequences of strains 171500 (PCR type 1)(NCIMB 41081; 

PHLS R13537), 172450 (PCR type 5)(PHLS R12884), 170324 (PCR type 12) 
(NCIMB 41080; PHLS R12882),, 171448 (PCR type 12) (PHLS R13550), 171862 
(PCR type 17) (PHLS R13702), 173644 (PCR type 31) (PHLS R13711), 170444 
(PCR type 46) (PHLS R12883) and 170426 (PCR type 92) (PHLS R12871) with 

20 translations thereof are given in Appendices 1 to 8. Substantial variation in 

nucleotide and predicted amino acid sequence was found between strains of PCR 
types 1, 5, 12, 17 and 31. The genes from strains of PCR types 46 and 92 are almost 
identical in sequence to those of PCR type 12. When the DNA sequences of genes 
of different strains within a PCR type are compared, the sequences are almost if not 

25 quite identical, indicating that the potential for variation is not infinite. These 

findings are in agreement with serotyping studies [Delmee et al., 1986, 1990], and 
indicate that the production of an effective vaccine based on the slpA product is 
feasible. In this respect, the present invention includes all variant slpA genes and 
their products, individually and combined, fragments of them, and their mutants and 

30 derivatives. 

One aspect of the invention provides the combination of immunodominant eptopes 
from the slpA gene products from various serotypes into a single vaccine. In this 
way a single vaccine may be used to immunise against several different C. difficile 
35 strains. 
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The most common PCR types isolated from infections in the clinical study carried 
out at St. James's Hospital, Dublin, Ireland were PCR types 1 and 12. However, a 
vaccine which elicits an intense antibody response against many infecting types 
would be therapeutically very valuable. Recombinant DNA chimera, or several 
5 chimeras, encoding contiguous immunodominant epitopes may be made for use in 

the vaccine. The recombinant DNA may serve as the active component in a vaccine, 
or may be inserted into an appropriate expression system for the generation of a 
chimeric peptide vaccine in a suitable host. 

10 Chimeras can be generated by PCR amplification of the DNA encoding peptide 

regions of interest, incorporating cleavage sites for restriction endonucleases into the 
primers. The amplified fragments can thus be cleaved to generate compatible ends, 
and spliced together to create chimeras. 

15 The dominant epitopes may be identified by cleavage of the slpA products into 

fragments by agents which cleave at known sites, and by immunoblotting with 
homologous patient serum. Immunodominant peptides may be tested for their 
capacity to stimulate T-cell proliferative responses in vitro, using mouse splenic T- 
cells. 

20 

DNA vaccination involves immunisation with recombinant DNA encoding the 
antigen or epitope of interest, cloned in a vector which promotes high level 
expression in mammalian cells. Typically, the vector is a plasmid vector which 
which also replicates in a procaryotic vector such as Escherichia coli, so that the 
25 DNA can be produced in quantity. Following immunisation, the plasmid enters a 

host cell, where it remains in the nucleus, and directs synthesis of the recombinant 
polypeptide. The polypeptide stimulates the production of neutralising antibodies, as 
well as activating cytotoxic T-cells. 

30 Using a DNA vaccine, it may be necessary to modify the DNA sequence to take 

account of codon usage in humans. The G+C content of mammalian DNA is much 
higher than that of C. difficile. The generation of such synthetic DNA molecules, 
essentially containing numerous silent mutations, is within the scope of the 
invention. 



35 
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A peptide vaccine will ideally be made using recombinant peptides. Similar 
considerations apply as in the generation of a DNA vaccine with regard to expression 
in a different host, such as Escherichia coli, which has a different codon usage 
pattern to C. difficile. Problems of expression may be overcome by the use of a 
5 special host strain which carries additional copies of rare tRNAs (e.g. E. coli BL21- 

CodonPlus™-RIL from Stratagene), or by using de novo synthesis of a DNA 
segment carrying silent mutations which will enable normal expression in E. coli. 
There are many expression systems which are likely to allow high-level expression 
of slpA genes in E. coli. An example is the pB AD/Thio TOPO vector of Invitrogen, 

10 in which expressed genes are under control of the arabinose promoter, which is 

subject to positive and negative control, enabling very tight control of expression. In 
this vector, the recombinant protein is typically fused to a modified thioredoxin 
carrying several histidine residues which enable purification by nickel 
chromatography. The recombinant protein can be cleaved from the thioredoxin 

15 moiety by enterokinase enzyme. 

Affinity chromatography may also be used with fixed antibodies or some other agent 
which strongly binds the peptide of interest to purify the protein from the native 
organism. 

20 

Purified immunogenic peptides may be used in combination with other C. difficile 
sub-units as a combined vaccine against C. difficile. Potential candidates are the 
products of the other sip genes, which share limited homology with the slpA gene 
product and with the N-acetylmuramoyl L-alanine amidase, (CwlB), from Bacillus 
25 subtilis, and which may be involved in remodelling of the peptidoglycan. 

Oother purified proteins of C. difficile to which constitutive antibodies are detected 
in individuals recovering from C difficile infection are also within the scope of the 
present invention 

30 

A deposit of Clostridium difficile strain 171500, PCR type 1, was made at the 
NCIMB on January 29, 2001, and accorded the accession number NCIMB 41081. 

A deposit of Clostridium difficile strain 170324, PCR type 12, was made at the 
35 NCIMB on January 29, 2001, and accorded the accession number NCIMB 41080. 
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Two peptides of the invention were found to contain the following sequences: 
33kDa peptide 

SEQ ID No. 1 : DKTKVETADQGYTVVQSK YK 

5 

31kDa peptide 

SEQ ID No. 2 ATTGTQGYTVVKNDGKKAVK 

The invention will be more clearly understood from the following examples. 

10 

Example 1. Clinical Study 

Examination of sequential antibody responses to C. difficile among elderly patients 
who developed the disease was carried out. The study was based on the hypothesis 
15 that the host immune response influenced the development of Clostridium difficile 

disease. In particular we determined that a particular pattern of immune response to 
C. difficile antigens correlated with the outcome of CDD. 

Materials and Methods 

20 

Patients 

Serum was collected from over 300 patients and of these 30 patients developed 
CDD. The infecting strain (homologous strain) was grown from each patient. 
Strains of C. difficile were typed at the Anaerobe Reference Laboratory, Wales 

25 [O'Neill et aL, 1996]. The most common strains isolated were PCR type 1 (n = 15) 

which is the most common type causing epidemics and PCR type 12 (n = 5) which is 
also a common hospital strain. Pre-infection serum samples were obtained from 
patients. Acute phase sera were then collected from patients who developed C. 
difficile disease. Convalescent sera were collected from patients who recovered. 

30 Protein extracts of patients' infecting C. difficile strain were probed with the patients 

sera using Western blotting. IgG responses to the antigens were examined. 

Western blotting 

Proteins from SDS-PAGE gels were electroblotted (0.8mA/cm2 for 1 h) to PVDF 
35 membrane using a semi-dry blotting apparatus (Atto). Primary antibodies (human 
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serum: 1/50 - 1/10,000 dilution) were detected using a 1/5000 dilution of anti-human 
IgG (horse radish peroxidase-conjugated) in combination with enhanced 
chemiluminesence (ECL). Blots were washed in phosphate buffered saline (pH 7.5) 
containing Tween 20 (0.1% v/v), and incubated in the same solution comprising 
5 dried skim milk (5% w/v) and antibodies at the appropriate concentration. Blots 

were exposed to Kodak X-OMAT film for various periods of time and developed. 

Results 

Overall 5 patients made a full recovery and new antibody responses to previously 
unrecognised antigens were evident in 4 of these patients. Three of these patients 
had C. difficile belonging to PCR type 1 and one patient had C. difficile PCR type 12. 
These patients developed an acute phase antibody response to previously 
unrecognised C. difficile antigens which persisted during convalescence (Figs. 1A 
and IB). These antigens were recognised by antibodies from patients who recovered 
and represent potential candidate vaccine antigens. Fig 1A shows a strong reaction 
of convalescent antibodies was observed with the 33 kDa antigen (Lane 4, arrow). 
Fig IB shows a strong reaction of convalescent antibodies was observed with the 31 
kDa antigen (Lanes 6 and 7, arrow). 

20 These antibody responses have also been found in some controls in the same ward 

who were also on antibiotics but who did not develop CDD. 

Example 2. Further characterisation of protective antigens 
Materials and Methods 

Partial purification and N-terminal sequencing of the 33 kDa and the 31 kDa proteins 

The antigens were partially purified from C. difficile based on their molecular weight 
using preparative continuous-elution SDS-PAGE on a model 491 Prep-Cell (Bio- 
Rad). The appropriate antigens were subsequently identified on Western blots 
probed with serum obtained from individuals who recovered from C difficile 
infection. 

Preparation of surface layer proteins (SLPs) 

SLPs were purified from C. difficile by extracting washed cells with 8 M urea, in 50 
35 mM Tris HC1, pH 8.3 in the presence of a cocktail of protease inhibitors 



10 
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25 
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(Complete®, Boehringer Mannheim), for 1 h at 37°C, followed by centrifugation for 
19 000 x g for 30 min. The SLPs were recovered in the supernatant and dialysed to 
remove the urea [Cerquetti et al., 2000]. 

5 Results 

The immunodominant protein which was associated with a positive outcome from C 
difficile strain 171500 (PCR type 1) was identified and purified using preparative 
SDS-PAGE. The N-terminal region of the protein was sequenced using an Applied 
Biosystems Procise Sequencer, viz DKTKVETADQGYTVVQSKYK (SEQ ID 
10 No. 1) 

The antigen which was associated with a protective antibody response from the C. 
difficile strain 170324 (PCR type 12) was identified and the N-terminal sequence 
obtained, viz ATTGTQGYTWKNDGKKAVK (SEQ ID No. 2). 

15 

These sequences were used to interrogate the C. diffcile genome sequence using the 
TBLASTN programme, which compared our query sequences with those of the 
genome project (available at web address 

http://www.sanger.ac.Uk/Proiects/C difficile/) , translated in all 6 possible reading 

20 frames. A nearly identical stretch of sequence was identified when the sequence 

from strain 1710324 (type 12) was used for interrogation. The same stretch of 
sequence was picked up with the sequence from strain 171500 (type 1) was used, 
although the identity was much less strong. Since the homologous sequence 
belonged to an open reading frame encoding a 719-residue peptide, this result was 

25 somewhat surprising. However, when the N-terminal sequences from the higher 

molecular weight SLP component were later published by Cerquetti et al [2000], it 
became apparent that they were encoded downstream along the same gene, 
subsequently identified as slpA, and the reason for the discrepancy in size between 
the gene and its products became readily apparent. 

30 

The purified SLPs from strains 171500 (PCR type 1) and 170324 (PCR type 12) 
showed strong reactivity with homologous convalescent serum, and co-migrated 
with the dominant antigens detected in crude cell extracts as shown in Fig. 2. Lanes 
1 and 3 contain crude antigen preparations from PCR types 1 and 12 respectively, 
35 and Lanes 2 and 4 contain SLP preparations from PCR types 1 and 12, respectively. 
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Panel A was probed with serum from a patient recovering from infection with PCR 
type 1, and Panel B was probed with serum from a patient recovering from infection 
with PCR type 12. Each serum detected 2 major antigens in the infecting strain 
(Panel A, Lane 3); (Panel B, Lane 1), which co-migrated with the 2 SLPs (Panel A, 
5 Lane 4; Panel B, Lane 2), with which the sera also reacted strongly. Note that serum 

from the patient infected with the PCR type 1 strain recognised the higher molecular 
weight SLP from the PCR type 12 strain (Panel A, Lanes 1 and 2), whereas the 
converse did not occur (Panel B, Lanes 3 and 4). There is no apparent antigenic 
cross-reactivity with regard to the lower molecular weight SLPs. 

10 

SLPs were prepared from selected strains by urea extraction, and subjected to SDS- 
PAGE and staining with Coomassie Blue (Fig. 3). Most strains showed a 
characteristic profile, with two major bands located in the 29 000 to 36 000 and 45 
000 to 50 000 molecular weight range. An exception was strain 172450 (Fig. 3, 
15 Lane 2), which showed a single, high molecular weight band, approximately 43 000 

in size. 

Cloning, sequencing and analysis of slvA genes 

The nucleotide sequences of the slpA genes from the two sample strains of C. 

20 difficile (PCR types 1 and 12, deposited at the NCIMB) and of several others (PCR 

types 5, 12, 17, 31, 46 and 92, available from the Anaerobe Reference Unit at the 
Department of Medical Microbiology and Public Health Laboratory, Cardiff, Wales 
were obtained. The slpA gene and flanking sequence was amplified by polymerase 
chain reaction from genomic DNA prepared from C. difficile using a commercial kit 

25 (Puregene® DNA isolation kit for yeast and Gram positive bacteria, Gentra systems 

Minneapolis, MN). The forward primer (5' ATGGATTATTATAGAGATGTGAG 
3'), was based on sequence from the genome sequencing project, starting 112 
nucleotides upstream from the start of the slpA open reading frame. Two reverse 
primers were used, depending on the PCR type. A downstream primer (5' 

30 CTATTTAAAGTTTTATTAAAACTTATATTAC 3') was used to amplify slpA 

from PCR types 12, 17, 31, 46 and 92. A reverse primer based on the 3' end of the 
slpA open reading frame from strain 630 and the subsequent nonsense codon (5' 
TTAC ATATCTAATA AATCTTTC ATTTTGTTTATAACTG 3 ') was used to 



WO 02/062379 



17 



PCT/IE02/00017 



amplify sip A from PCR types 1 and 5. The choice of primer for the latter two PCR 
types may have resulted in a small number of systematic errors in the nucleotide 
sequence obtained. PCR was carried out using HotStar™ Taq polymerase (Qiagen 
Ltd., Crawley, West Sussex, UK) according to the manufacturer's instructions. A 
5 single fragment of approximately 2 kb was obtained for each strain, which was then 

cloned into the pBAD/Thio TOPO vector (Invitrogen, Groningen, Netherlands). 
Inserts were sequenced from both ends by standard procedures in commercial 
facilities at MWG (Wolverton Mill South, Milton Keynes, UK) and Cambridge 
University. New primers were designed on the basis of initial sequencing results, 
10 enabling sequencing of both strands to be completed (a process known as 

chromosome walking). 

The results are shown in Appendices 1-8. 

15 The nucleotide sequences were translated to enable prediction of the amino acid 

sequence(s) of the product (s) (Appendices 1-8). The N-terminal sequences obtained 
experimentally for the low molecular weight protective antigens from strains 171500 
(PCR type 1) and 170324 (PCR type 12) were almost identical to those predicted 
from the nucleotide sequences of their respective sip A genes (18/20 identical 

20 residues for strain 171500, and 19/20 identical residues for strain 170324). 

Appendix 1 shows the open reading frame with translation for slpA from strain 
171500 (PCR type 1), SEQ ID No 3. Since the reverse primer was based on the 35 
nucleotides from the 3' end of the slpA gene, the sequence is not necessarily 100% 
25 accurate in this region. However, this part of the gene does not seem to vary greatly 

from strain to strain. 

Appendix 2 shows the open reading frame with translation for slpA from strain 
172450 (PCR type 5), SEQ ID No 4. Again, the sequence obtained for the 3' 35 
30 nucleotides is not fully reliable. This gene is considerably smaller than the other 

slpA genes sequenced, and shows strong sequence divergence from the other PCR 
types examined. 

Appendix 3 shows the open reading frame with translation for slpA from strain 
35 170324 (PCR type 12) , SEQ ID No 5. This gene showed a single base difference 
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when compared with the strain used for the genome sequencing project, strain 630, 
of the same PCR type. The deduced amino acid sequence is identical. 

Appendix 4 shows the open reading frame with translation for slpA from strain 
5 171448 (PCR type 12), SEQ ID No 6. This gene was almost identical in sequence to 

that from strain 170324. 

Appendix 5 shows the open reading frame with translation for slpA from strain 
171862 (PCR type 17), SEQ ID No 7. 

10 

Appendix 6 shows the open reading frame with translation for slpA from strain 
173644 (PCR type 31), SEQ ID No 8. Like the slpA from strain 172450, this 
sequence is very dissimilar to those of slpA genes from other PCR types 
encountered. 

15 

Appendix 7 shows the open reading frame with translation for slpA from strain 
170444 (PCR type 46), SEQ ID No 9. This sequence is virtually identical to that 
obtained for slpA from PCR type 12 and 92 strains. 

20 Appendix 8 shows the open reading frame with translation for slpA from strain 

170426 (PCR type 92), SEQ ID No 10. This sequence is virtually identical to that 
obtained for slpA from PCR type 12 and 46. 

The cleavage site of the putative signal sequences from both genes was determined 
25 from experimental evidence (the N-terminal sequence of the mature proteins as 

determined by Edman degradation), and by the prediction tool of the Centre for 
Biological Sequence Analysis at the Technical University of Denmark [Nielsen et 
al., 1997]. The site for cleavage of the slpA gene product to form the mature SLPs 
was predicted from experimental [Cerquetti et al., 2000, Karjalainen et a!., 2001 and 
30 Calabi et al., 2001]. The cleavage site is typically preceded by the motif TKS. 

However, the relevant motif is likely to be TKG in strain 173644 (PCR type 31). No 
obvious motif appeared for strain 172450 (PCR type 5). However, the protein 
produced by type 5 strains does appear to be cleaved; hence we predicted the site to 
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occur at a point where the SLP sequence aligns with the cleavage sites of other PCR 
types. 

The molecular weight and isoelectric point was calculated for each of the predicted 
5 mature proteins by the ExPASy server of the Swiss Institute for Bioinformatics 

(Table 1). In general, the calculated molecular weights were in fair agreement with 
apparent molecular masses determined from migration in gels (Fig. 3). No lower 
molecular weight band was apparent for Strain 172450 (PCR type 5; Lane 2). 
However, a higher molecular weight band is present, which is similar in size to the 
10 predicted weight for the C-terminal moiety. We observed a similar profile for 

another type 5 strain. It is possible that the lower molecular weight species is subject 
to degradation in this strain. Another possibility is that it is heavily glycosylated, 
which can affect staining. All peptides had a predicted isoelectric point below 7, 
typical of acidic proteins, and characteristic of SLPs in general [Sleyter et al, 1993]. 

15 

Table 1 



C. difficile strain (PCR type) 


Pi 

(N-terminal) 


Pi 

(C-terminal) 


MW 
(N-terminal) 


MW 
(C-terminal) 


171500 (Type 1) 


4.83 


4.66 


33365.41 


44220.37 


172450 (Type 5) 


4.86 


4.65 


19364.46 


42757.63 


170324 (Type 12) 


4.92 


4.58 


34228.25 


39522.24 


171448 (Type 12) 


4.98 


4.58 


34156.18 


39492.21 


171862 (Type 17) 


5.09 


4.53 


33783.73 


39407.11 


173644 (Type 31) 


5.05 


4.56 


33626.48 


41821.69 


170444 (Type 46) 


5.06 


4.58 


34230.31 


39522.24 


170426 (Type 92) 


4.99 


4.58 


34242.32 


39522.24 



The translated nucleotide sequences were compared with published SlpA sequences 
(EMBL Accession numbers AJ300676, and AJ300677 for examples from PCR types 
1, and 17 respectively; strain 630 available from the Sanger Institute for PCR type 
12; EMBL Accession number AY004256 for a variant from an unnamed PCR type). 
The Clustal W alignment programme, which is freely available, was used. Where 
SlpA sequences from our isolates were compared with those of other strains of the 
same PCR types, they were found to be nearly or quite identical. This observation 
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indicates, together with existing knowledge from serotyping, that the number of 
variants of slpA is not infinite, and that natural evolution of the gene is not rapid. 
Table 2 shows a compilation of homologies, based on amino acid residue identity, 
for the different translated sequences measured against published sequences. 
5 Homologies are compiled for the predicted mature peptides, either combined (Table 

2A) or as N-terminal (low molecular weight, less conserved moiety) (Table 2B) and 
C-terminal (high molecular weight, more conserved) (Table 2C) mature peptides 
according to predicted cleavage sites. It is clear that the SlpA sequences from strains 
172450 (PCR type 5) and 173644 (PCR type 31) are quite distinct particularly with 
10 respect to N-terminal region. 

Table 2A 



Strain.type 


630 
(type 12) 


AJ300676 
(type 1) 


AJ300677 
(type 17) 


AY004256 
(type unknown) 


171500.typel 


55.2 


99.7 


55.4 


56.42 


172450.type5 


49.8 


54.0 


49.9 


47.77 


170324.typel2 


100.0 


57.8 


81.7 


59.77 


171448.typel2 


99.7 








171862.typel7 


82.3 


58.7 


100 


57.54 


173644.type31 


57.9 


59.2 


60.1 


56.88 


170444.type46 


99.6 








170426.type92 


99.9 









Table 2B 



Strain.type 


630 
(type 12) 


AJ300676 
(type 1) 


AJ300677 
(type 17) 


AY004256 
(type unknown) 


171500.typel 


35.4 


100 


34.5 


33.54 


172450.type5 


31.6 


32.2 


31.0 


24.58 


170324.typel2 


100 


34.9 


64.6 


36.14 


171448.typel2 


99.7 








171862.typel7 


64.3 


34.4 


100 


31.55 


173644.type31 


37.5 


34.1 


41.3 


31.86 


170444.type46 


99.1 








170426.type92 


99.7 
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Table 2C 



otrdiri.Ly pc 


630 
(type 12) 


AJ300676 
(type 1) 


AJ300677 
(type 17) 


AY004256 
(type unknown) 


1715O0.typel 


70.2 


99.5 


71.2 


73.80 


172450.type5 


58.4 


60.4 


63.0 


57.60 


170324.typel2 


100 


77.3 


97.1 


80.00 


171448.typel2 


99.7 








171862.typel7 


97.3 


78.8 


100 


79.62 


173644.type31 


74.1 


78.9 


75.1 


75.38 


170444.type46 


100 








170426.type92 


100 









The term antibody used throughout the specification includes but is not limited to 
polyclonal, monoclonal, chimeric, single chain, Fab fragments and fragments 
produced by a Fab expression library. 

The antibodies and fragments thereof may be humanised antibodies. Neutralising 
antibodies such as those which inhibit biological activity of the substance amino acid 
sequence are especially preferred for diagnostics and therapeutics. 

Antibodies both polyclonal and monoclonal which are directed against epitopes 
obtainable from a polypeptide or peptide of the present invention are particularly 
useful in diagnosis and those which are neutralising are useful in passive 
immunotherapy. 

Antibodies may be produced by any of the standard techniques well known in the art. 

A therapeutically effective amount of the polypeptide, polynucleotide, peptide or 
antibody of the inventiion in the form of pharmaceutical composition may be 
adminsistered. The composition may optionally comprise a pharmaceutically 
acceptable carrier, diluent or excipients and including combinations thereof. The 
pharmaceutical composition may be used in conjugation with one or more additional 
pharmaceutically active compounds and/or adjuvants. 
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Different adjuvants depending on the host may be used to increase immunological 
response. The adjuvant may be selected from the group comprising Freunds, mineral 
gels such as aluminium hydroxide and surface active substances. 

5 The vaccine of the invention may be in the form of an immune modulating 

composition or pharmaceutical composition and may be administered by a number of 
different routes such as by injection (which includes parenteral, subcutaneous and 
intramuscular injection) intranasal, intramuscular, mucosal, oral, intra-vaginal, 
urethral or ocular administration. There may be different formulation/composition 
10 requirements dependent on the different delivery systems. 



The invention is not limited to the embodiments hereinbefore described which may 
15 be varied in detail. 
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Appendix 1 

SEQ ID No. 3. Nucleotide sequence of slpA from Clostridium 
difficile strain 171500, PCR type 1, with translation. The putative 
5 secretory signal cleavage site (A) and site of cleavage to form the 

two mature SLPs (♦) are indicated. 

1 ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCA 60 
+ + + + + 

10 1MNKKNIAIAMSGLTVLASAA 
20 

61 

CC T G TAT T T GC AG AT G AT AC AAAAG T T G AAAC T G G T GAT CAAGG AT AT AC AG T G G T AC AA 120 

15 + + + + + — 

+ 

21 PVFADDTKVETGDQGYTVVQ 

40 

A 

20 121 

AGC AAG T AT AAG AAAGC T G T T G AAC AAT T AC AAAAAGG AAT AT TAG AT G G AAG T AT AAC A 180 

+ + + + + 

41 SKYKKAVEQLQKGI LDGS I T 

60 

25 181 

GAAATTAAAGTTTTCTTTGAGGGAACTTTAGCATCTACTATAAAAGTAGGTTCTGAGCTT 24 0 

+ + + + + 

61 E IKVFFEGTLASTI KVGSEL 

80 

30 241 

AAT GC AG C AG AT G C AAG T AAAT TAT T G T T T ACAC AAG TAGAT AAT AAAC T AGAT AAT T T A ' 300 

+ + + + + 

81NAADASKLLFTQVDNKLDNL 

100 

35 301 

G G T GAT G GAG AT T AT G T AGAT T T C T T AAT AAC T T C T C C AG G T C AAG G G G AT AAAAT AAC T 360 

+ + + + + 

101 GDGDYVDFLITSPGQGDKIT 

120 

40 361 

ACAAGTAAACTTGTTGCATTGAAAGATTTAACAGGTGCTTCAGCAGATGCTATAATTGCT 4 20 

+ + + + + 

121 TSKLVALKDLTGASADAI IA 

140 

45 421 

GGAACATCTTCAGCAGATGGTGTTGTTACAAATACTGGAGCTGCTAGTGGTTCTACTGAG 4 80 

+ + + + + 

141 GTS SADGVVTNTGAASGSTE 

160 
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481 

ACAAATTCAGCAGGAACAAAACTTGCAATGTCAGCTATTTTTGACACAGCATATACAGAT 54 0 

+ + + + + 

161 TNSAGTKLAMSAI FDTAYT D 

5 180 

541 

T CAT C T G AAAC T GC G G T T AAG AT T AC T AT AAAAG CAG AT AT G AAT GAT AC T AAAT T T G GT 600 
181 S S ETAVKIT IKADMNDTKFG 

10 200 

601 

AAAGC AGG T G AG AC AAC T TAT T C AAC T G GG C T T AC AT T T G AAG AT G G G T C T AC AG AAAAA 660 

+ + + + + 

201 KAGETTYSTGLTFEDGSTEK 

15 220 

661 

AT T GT T AAAT T AGG G G AC AG T GAT AT TAT AG AT AT AAC T AAAG C T C T T AAAC T T AC T GT T 720 

+ + + + + 

221 IVKLGDSDI IDITKALKLTV 

20 240 

721 

GTTCCTGGAAGTAAAGCAACTGTTAAGTTTGCTGAAAAAACACCAAGTGCCAGTGTTCAA 7 8 0 
241 VPGSKATVKFAEKT PSASVQ 

25 260 

781 

C CAG T AAT AAC AAAG C T TAG AAT AAT AAAT G C T AAAG AAG AAAC AAT AG AT AT T G ACGC T 8 4 0 
261 PVITKLRIINAKEETIDI DA 

30 280 

841 

AGT T C T AGT AAAAC AG C AC AAG AT T TAG C T AAAAAAT AT G T AT T T AAT AAAAC T GAT T T A 900 

+ + + + + 

281 SSSKTAQDLAKKYVFNKTDL 

35 300 

901 

AATACTCTTTATAAAGTATTAAATGGAGATGAAGCAGATACTAATGGATTAATAGAAGAA 960 

+ + + + + 

301 NTLYKVLNGDEADTNGLI EE 

40 320 

961 

GTTAGTGGAAAATATCAAGTAGTTCTTTATCCAGAAGGAAAAAGAGTTACAACTAAGAGT 102 0 
+ + + + + 

321 VSGKYQVVLYPEGKRVTTKS 

45 340 

1021 

GCTGCAAAGGCTTCAATTGCTGATGAAAATTCACCAGTTAAATTAACTCTTAAGTCAGAT 1080 
+ + + + + 

341 AAKAS IADENS PVKLTLKS D 

50 360 

♦ 

1081 

AAGAAG AAAG AC T T AAAAG AT TAT G T G GAT GAT T T AAG AACAT AT AAT AAT GG AT AT T C A 114 0 

+ + + + + 

55 361 KKKDLKDYVDDLRTYNNGYS 

380 



WO 02/062379 



27 



PCT/IE02/00017 



1141 

AAT G C TAT AG AAG T AG C AG GAG AAG AT AG AAT AG AAAC T G C AAT AG CAT T AAG T C AAAAA 1200 
+ + + + + 

381 NAIEVAGEDRI ETAIALSQK 

5 400 

1201 

TAT T AT AAC T C T GAT GAT G AAAAT G C TAT AT T TAG AG AT T C AG T T GAT AAT G TAG TAT T G 12 60 
+ + + + + 

401 YYNSDDENAI FRDSVDNVVL 

10 420 

1261 

GTTGGAGGAAATGCAATAGTTGATGGACTTGTAGCTTCTCCTTTAGCTTCTGAAAAGAAA 1320 
+ + + + + 

421 VGGNAIVDGLVASPLASEKK 

15 440 

1321 

GC T C C T T TAT TAT T AAC T T CAAAAG AT AAAT TAG AT T CAAG CG T AAAAGC T G AAAT AAAG 1380 
+ + + + + 

441 APLLLTSKDKLDSSVKAEI K 

20 4 60 

1381 

AG AG T TAT G AAT AT AAAG AG T AC AAC AG G TAT AAAT AC T T C AAAG AAAG T T TAT T TAG C T 14 40 
+ + + + + 

461 RVMNIKSTTGINTSKKVYLA 

25 480 

1441 

GGTGGAGTTAATTCTATATCTAAAGAAGTAGAAAATGAATTAAAAGATATGGGACTTAAA 1500 
+ + + + + 

481 GGVNSISKEVENELKDMGLK 

30 500 

1501 

G T T AC AAG AT T AGC AGG AG AT GAT AGAT AT GAAAC T T C T C T AAAAAT AG C T GAT G AAG T A 15 60 
+ + + + + 

501 VTRLAGDDRYETSLKIADEV 

35 520 

1561 

GGTCTTGATAATGATAAAGCATTTGTAGTTGGAGGAACAGGATTAGCAGATGCCATGAGT 1620 

+ + + + + 

521 GLDNDKAFVVGGTGLADAMS 

40 540 

1621 

ATAGCTCCAGTTGCATCTCAATTAAGAAATGCTAATGGTAAAATGGATTTAGCTGATGGT 168 0 
+ + + + + 

541 IAPVASQLRNANGKMDLADG 

45 560 
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1681 

GATGCTACACCAATAGTAGTTGTAGATGGAAAAGCTAAAACTATAAATGATGATGTAAAA 17 4 0 

+ + + + + 

561 DATPIVVVDGKAKTINDDVK 

5 580 

1741 

GATTTCTTAGATGATTCACAAGTTGATATAATAGGTGGAGAAAACAGTGTATCTAAAGAT 18 00 

+ + + + + 

581 DFLDDSQVDI I GGENSVSKD 

10 600 

1801 

GTTGAAAATGCAATAGATGATGCTACAGGTAAATCTCCAGATAGATATAGTGGAGATGAT 18 60 

+ + + + + 

601 VENAI DDATGKS PDRYSGDD 

15 620 

1861 

AG AC AAGC AAC T AAT G C AAAAG T T AT AAAAG AAT C T T C T TAT TAT C AAG AT AAC T T AAAT 1920 

+ + + + + 

621 RQATNAKVI KES SYYQDNLN 

20 640 

1921 

AAT G AT AAAAAAG TAG T T AAT T T C T T T G T AGC T AAAG AT GG T T C T AC T AAAG AAG AT C AA 1980 
641 NDKKVVNFFVAKDGS TKEDQ 

25 660 

1981 

TTAGTTGATGCTTTAGCAGCAGCTCCAGTTGCAGCAAACTTTGGTGTAACTCTTAATTCT 204 0 

+ + + + + 

661 LVDALAAAPVAANFGVTLNS 

30 680 

2041 

GATGGTAAGCCAGTAGATAAAGATGGTAAAGtATTAACTGGTTCTGATAATGATAAAAAT 2100 

+ + + + + 

681 DGKPVDKDGKVLTGS DNDKN 

35 700 

2101 

AAATTAGTATCTCCAGCACCTATAGTATTAGCTACTGATTCTTTATCTTCAGATCaAAGT 2160 

+ + + + + 

701 KLVSPAPIVLATDSLSSDQS 

40 720 

2161 

GTATCTATAAGTAaAGTTCTTGATAAAGATAATGGAGAAAACTTAGTTCAAGTTGGTAAA 2220 
721 VS I SKVLDKDNGENLVQVGK 

45 740 

2221 G G TAT AGC T AC T T C AG T T AT AAAC AAAAT G AAAGAT T TAT TAG AT AT G 22 68 

+ + + + 

741 GIATSVINKMKDLLDM 756 



50 
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Appendix 2 

SEQ ID No. 4. Nucleotide sequence of slpA from Clostridium 
difficile strain 172450, PCR type 5, with translation. The 
5 putative secretory signal cleavage site (A) is indicated, and an 

approximation of the and site of cleavage to form the two mature 

SLPs (♦) is also indicated. 
1 

10 ATGAAAAAAAGAAATTTAGCAATGGCTATGGCAGCTGTTACTGTAGTAGGTTCTGCTGCT 60 

+ + + + + 

1 MKKRNLAMAMAAVTVVGSAA 

20 

61 

1 5 C C AG T T T T T G C AG C AGC T T C AG AT GT AAT AT C AC T AC AAG AT G G T AC AAAT G AT AAGT AT 120 

+ + + + + 

21 PVFAA AS DV I S L Q DG TN DKY 

40 

A 

20 121 

AC AG T ATCAAAT AC T AAAG C T AGT GAC T TAG T AAAG GAT AT T T T AGCAGC ACAAAAC T T A 180 

. + + + + + 

41 TVSNTKASDLVKDILAAQNL 

60 

25 181 

AC AAC AGG T GC AG T TAT T T T G AAC AAAG AT AC AAAAG T T AC T T T C TAT GAT G C AAAT GAG 24 0 

+ + + + + 

61TTGAVILNKDTKVTFYDANE 

80 

30 241 

AAAG AT T C T T C AAC T CC AAC T G G AG AT AAAAAAG T T TAT T C AG AAC AAAC T T T AAC T AC A 300 

+ + + + + 

81KDS STPTGDKKVYSEQTLTT 

100 

35 301 

GC T AAT G G AAAT G AAG AT TAT G T AAAG AC AAC T T T AAAAAAT T TAG AT G C AG GAG AAT AT 360 

+ + + + + 

101 ANGNEDYVKTTLKNLDAGEY 

120 

40 361 

GCTATTATAGATTTAACTTATAATAATGCTAAAACTGTTGAAATTAAAGTAGTAGCAGCT 4 20 

+ + + + + 

121 AI I DLTYNNAKTVEI KVVAA 

140 

45 421 

AG T G AAAAAAC AG TAG T T G TAT C TAG T GAT G CG AAAAAT AG T G C AAAAGAT AT AGC TG AA 4 80 

+ + + + + 

141 SEKTVVVSS DAKNSAKDIAE 

160 

50 

481 

AAAT AT G T G T T T GAAG AC AAAG AC T T AG AAAAT G C AC T AAAAAC TAT AAAT G C C T C AG AT 54 0 

+ + + + + 

161 KYVFEDKDLENALKT INASD 

55 180 
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541 

TTCAGTAAAACTGATAGTTACTATCAAGTAGTTCTTTATCCAAAAGGAAAGAGATTACAA 600 

+ + + + + 

181 FS KTDS YYQVVLYPKGKRLQ 

200 

601 

GGTTTCTCAACTTATAGAGCTACAAATTATAATGAAGGAACTGCATATGGTAATACACCA 660 

+ + + + + 

201 GFSTYRATNYNEGTAYGNTP 

220 

♦ 

661 

G T AAT AT T AAC T C T AAAAT C T AC TAG T AAG AG T AAT T T AAAG AC T G C AG T AG AAG AG T T A 720 

+ + + + + 

221 VILTLKSTSKSNLKTAVEEL 

240 

721 

C AAAAAT T G AAT G C TAG T TAT T C T AAT AC T AC AAC T T T AGC T GG T GAT G AC AG AAT AC AA 7 80 

+ + + + + 

241 QKLNAS YSNTTTLAGDDRI Q 

260 

781 

ACAGCTATAGAGATAAGTAAAGAATATTACAATAATGATGGCGAGAAATCAGATCATTCA 8 4 0 
261 TAIEISKEYYNNDGEKSDHS 

280 

841 

GCTGATGTTAAAGAGAATGTT AAAAAT GTTGTAT TAG TAGGTGCAAATGC ACT AGTAGAT 900 

+ + + + + 

281 ADVKENVKNVVLVGANALVD 

300 

901 

G GAT TAG TTGCGGCTCCTT TAG C AG C AG AAAAAG AT G C T CC AC TAT TAT T AAC T T C AAAA 9 60 
301 GLVAAPLAAEKDAPLLLTSK 

320 

961 

G AT AAAT TAG AT T C G T C AG T AAAAT C T G AAAT AAAG AG AG T T T TAG AC T T AAAAAC T T C A 1020 
+ + + + + 

321 DKLDSSVKSE I KRVLDLKT S 

340 

1021 

ACAGAAGTAACAGGAAAAACAGTTTATATAGCTGGTGGAGTTAATAGTGTATCTAAAGAA 1080 
341 TEVTGKTVYIAGGVNSVSKE 

360 

1081 

G T T G T AAC AG AAT T AGAAT C AAT G GG AT T AAAAG T T G AAAG AT T C T CAGGT G AT G AT AGA 114 0 
361 VVTELESMGLKVERFSGDDR 

380 

1141 

TATGAAACTTCTTTAAAAATAGCAGGTGAAATAGGCTTAGATAATGATAAGGCTTATGTA 1200 
+ + + + + 
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381 YETS LKIAGEIGLDNDKAYV 

400 

1201 

GTTGGTGGAACAGGATTAGCAGATGCCATGAGTATAGCTTCAGTTGCTTCTACTAAATTA 12 60 



401 VGGTGLADAMS IASVASTKL 

420 

1261 

G AT GG T AAT GG T G T T G T AG AT AG AAC AAAT G G AC AT GC T AC T C C AAT AG T T G T T G TAG AT 1320 



421 DGNGVVDRTNGHATPIVVVD 

440 

1321 

G G AAAAG C T G AT AAAAT AT C T GAT G AC T TAG AT AG T T T C T TAG G AAGCG C T GAT G TAG AT 1380 



441 GKADKISDDLDSFLGSADVD 

460 

1381 ATAATAGGTGGATTTGCAAGTGTATCTGAAAAGATGGAAGAAGCTATATCAGATGCTACT 
1440 



461 I IGGFASVSEKMEEAISDAT 

480 

1441 

G G T AAAG GC G T T AC AAG AG T T AAAG GC G AC GAT AG AC AAG AC AC T AAC T C T GAAG T TATA 1500 



481 GKGVTRVKGDDRQDTNSEVI 

500 

1501 

AAAACAT AT T AT GC T AAT GAT AC T G AAAT AG C T AAAGC T G C AG T T T T AGAT AAAG AT T C A 1560 



501KTYYAN DTEIAKAAVLDKDS 

520 

1561 

G G T G C T T C AAG TAGTGATG C AG GAG T AT T T AAT T T C TAT G T A G C T AAAG AT G G A T C T AC A 1620 



521 GAS S SDAGVFNFYVAKDGST 

540 

1621 

AAAG AAG AT C AAT TAG T T G AT GC AT TAG C AG TAG GAG CTGTTGCT GG AT AT AAAC T TGC T 168 0 



541 KEDQ LVDALAVGAVAGYKLA 

560 
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1681 

CCAGTTGTATTAGCTACTGATTCTTTATCTTCTGATCAATCGGTTGCTATAAGCAAAGTT 17 4 0 
561 PVVLATDSLSSDQSVAISKV 

580 

1741 

GTAGGAGAAAAATATTCTAAAGATTTAACACAAGTTGGTCAAGGAATAGCTAATTCAGTT 18 00 
581 VGEKYSKDLTQVGQGIANSV 

600 

1801 AT AAAC AAAAT GAAAG AT T TAT T AGAT AT G 1830 
601 INKMKDLLDM 610 
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Appendix 3 

SEQ ID No. 5. Nucleotide sequence of slpA from Clostridium 
difficile strain 170324, PCR type 12, with translation. The 
putative secretory signal cleavage site (A) and site of cleavage to 

form the two mature SLPs (♦) are indicated. 
1 

ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCT 
60 

+ + + + + 

1MNKKNIAIAMSGLTVLASAA 

20 

61 

CCTGTTTTTGCTGCAACTACTGGAACACAAGGTTATACTGTAGTTAAAAACGACTGGAAA 120 

+ + + + + 

21 PVFAATTGTQGYTVVKN DWK 

40 

A 

121 

AAAG C AG T AAAAC AAT T AC AAG AT G G AC T AAAAG AT AAT AGT AT AG G AAAG AT AAC T G T A 18 0 

+ + + + + 

41KAVKQLQDGLKDNS IGKITV 

60 

181 

TCTTTTAATGATGGGGTTGTGGGTGAAGTAGCTCCTAAAAGTGCTAATAAGAAAGCGGAC 2 4 0 

+ + + + + 

61 S FNDGVVGEVAPKSANKKAD 

80 

241 

AGAGATGCTGCAGCTGAGAAGTTATATAATCTTGTTAACACTCAATTAGATAAATTAGGT 300 
81RDAAAEKLYNLVNTQLDKLG 

100 

301 

G AT GG AGAT T AT GT TGAT T T T T CTGT AG AT TAT AAT T T AGAAAAC AAAAT AAT AACT AAT 3 60 

+ + + + + 

101 DGDYVDFSVDYNLENKIITN 

120 

361 

C AAG C AG AT G C AG AAG C AA T T G T T AC AAAG T T AAAT T C AC T T AAT GAG AAAAC T C T TAT T 4 2 0 
121 QADAEAIVTKLNSLNEKTL I 

140 

421 

GAT AT AG C AAC T AAAG AT AC T T T T G G AAT GG T TAG T AAAAC AC AAG AT AG T G AAG G T AAA 4 8 0 

+ + + + + 

141 DIATKDTFGMVSKTQDSEGK 

160 
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481 

AATGTTGCTGCAACAAAGGCACTTAAAGTTAAAGATGTTGCTACATTTGGTTTGAAGTCT 54 0 

+ + + + + 

161 NVAATKALKVKDVAT FGLKS 

180 

541 

G G T G G AAG C G AAG AT AC T G GAT AT G T T G T T G AAAT G AAAG C AGG AGC T G TAG AG G AT AAG 600 

+ + + + + 

181 GGS E DTGYVVEMKAGAVE DK 

200 

601 

TATGGTAAAGTTGGAGATAGTACGGCAGGTATTGCAATAAATCTTCCTAGTACTGGACTT 660 

+ + + + + 

201 YGKVGDSTAGIAINLPSTGL 

220 

661 

G AAT AT GC AGG T AAAG G AAC AAC AAT T GAT T T T AAT AAAAC T T T AAAAG T T GAT G T AAC A 720 

+ + + + + 

221 EYAG KGTT I DFNKTLKVDVT 

240 

721 

GGTGGTTCAACACCTAGTGCTGTAGCTGTAAGTGGTTTTGTAACTAAAGATGATACTGAT 780 

+ + + + + 

241 GGST PSAVAVSGFVTKDDTD 

260 

781 

T T AG C AAAA T C AG G T AC TAT AAAT G T AAG AG T TAT AAAT G C AAAAG AAG AAT C AAT T GAT 8 4 0 
261 LAKS GTINVRVINAKEES I D 

280 

841 

AT AG AT G C AAG C T CAT AT AC AT C AGC T G AAAAT T T AGC T AAAAG AT AT G TAT T T GAT CCA 900 

+ + + + + 

281 I DAS SYTSAENLAKRYVFDP 

300 

901 

GATGAAATTTCTGAAGCATATAAGGCAATAGTAGCATTACAAAATGATGGTATAGAGTCT 960 
301 DEI SEAYKAIVALQNDGIES 

320 

961 

AACTTAGTTCAGTTAGTTAATGGAAAATATCAAGTGATTTTTTATCCAGAAGGTAAAAGA 1020 
+ + + + + 

321 NLVQLVNGKYQVI FYPEGKR 

340 

1021 

T T AGAAAC T AAAT C AG C AAAT GAT AC AAT AG C TAG T CAAGAT AC ACCAGC T AAAG TAG T T 1080 
+ + + + + 

341 LETKSANDTIASQDTPAKVV 

360 

♦ 

1081 

ATAAAAGCTAATAAATTAAAAGATTTAAAAGATTATGTAGATGATTTAAAAACATATAAT 114 0 
+ + + + + 
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361 IKANKLKDLKDYVDDLKTYN 

380 

1141 

AAT AC T TAT T C AAAT G T T G T AAC AG TAG C AGG AG AAG AT AG AAT AG AAAC T G C T AT AG AA 1200 
5 + + + + + 

381 NTYSNVVTVAGEDRIETAIE 

400 

1201 

TTAAGTAGTAAATATTATAATTCTGATGATAAAAATGCAATAACTGATAAAGCAGTTAAT 12 60 

10 + + + + + - 

401 LSSKYYNS DDKNAI TDKAVN 

420 

1261 

GAT AT AG TAT TAG T T G GAT C T AC AT CT AT AG T T GAT GGTCTTGTT GC AT C AC CAT T AGC T 1320 

15 + + + + + 

421 DIVLVGSTSIVDGLVASPLA 

440 

1321 

T C AG AAAAAAC AGC T C CAT TAT TAT T AAC T T C AAAAG AT AAAT TAG AT T CAT C AG T AAAA 138 0 

20 + + + + + 

441 SEKTAPLLLTSKDKLDSSVK 

460 

1381 

T C T G AAAT AAAGAG AG T TAT G AAC T T AAAG AG T G AC AC T GG TAT AAAT AC T T C T AAAAAA 14 4 0 

25 + + + + + 

461 SEIKRVMNLKSDTGINTSKK 

480 

1441 

G T T TAT T T AGC T G G T GG AG T T AAT T C TAT AT C T AAAG AT G T AG AAAAT G AAT T G AAAAAC 1500 

30 + + + + + 

481 VYLAGGVNS I SKDVENELKN 

500 

1501 

AT G G GT C T T AAAG T T AC TAG AT TAT C AGG AG AAG AC AG AT AC G AAAC T T C T T TAG C AAT A 15 60 

35 + + + + + 

501 MGLKVTRLSGEDRYETSLAI 

520 

1561 

GCTGATGAAATAGGTCTTGATAATGATAAAGCATTTGTAGTTGGTGGTACTGGATTAGCA 1620 

40 + + + + + 

521 ADEI GLDN DKAFVVGGTGLA 

540 

1621 

GAT GC TAT GAG TAT AG C T C C AG TTGCTTCT CAAC T T AAAG AT G GAG AT GC T AC T C C AAT A 168 0 
45 + + + + + 

541 DAMS IAPVASQLK DGDAT P I 

560 

50 1681 

GTAGTTGTAGATGGAAAAGCAAAAGAAATAAGTGATGATGCTAAGAGTTTCTTAGGAACT 17 4 0 
+ + + + + 

561 VVVDGKAKEI SDDAKSFLGT 

580 

55 1741 

TCTGATGTTGATATAATAGGTGGAAAAAATAGCGTATCTAAAGAGATTGAAGAGTCAATA 1800 
+ + + + + 



WO 02/062379 



36 



PCT/IE02/00017 



581 SDVDIIGGKNSVSKEIEESI 

600 

1801 

GATAGTGCAACTGGAAAAACTCCAGATAGAATAAGTGGAGATGATAGACAAGCAACTAAT 18 60 

5 + + + + + 

601 DSATGKTPDRISGDDRQATN 

620 

1861 

GCTGAAGTTTTAAAAGAAGATGATTATTTCACAGATGGTGAAGTTGTGAATTACTTTGTT 1920 

10 + + + + + 

621 AEVLKEDDYFTDGEVVNYFV 

640 

1921 

GCAAAAGATGGTTCTACTAAAGAAGATCAATTAGTAGATGCCTTAGCAGCAGCACCAATA 1980 

15 + + + + + 

641 AKDGSTKEDQLVDALAAAPI 

660 

1981 

GCAGGTAGATTTAAGGAGTCTCCAGCTCCAATCATACTAGCTACTGATACTTTATCTTCT 204 0 
20 + + + + + 

661 AGRFKESPAPI ILATDTLSS 

680 

2041 

GACCAAAATGTAGCTGTAAGTAAAGCAGTTCCTAAAGATGGTGG7VACTAACTTAGTTCAA 2100 

25 + + + + + 

681 DQNVAVSKAVPKDGGTNLVQ 

700 

2101 GTAGGTAAAGGTATAGCTTCTTCAGTTATAAACA7\AATGAAAGATTTATTAGATATG 
2157 

30 + + + + + 

701 VGKG IASSVINKMKDLLDM 

719 



35 
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Appendix 4 

SEQ ID No 6. Nucleotide sequence of slpA from Clostridium 
difficile strain 171448, PGR type 12, with translation. The 
5 putative secretory signal cleavage site (A) and site of cleavage to 

form the two mature SLPs (♦) are indicated. 
1 

ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCT 60 

10 + + + + + 

1MNKKNIAIAMSGLTVLASAA 

20 

61 

CCTGTTTTTGCTGCAACTACTGGAACACAAGGTTATACTGTAGTTAAAAACGACTGGAAA 12 0 

15 + + + + + 

21PVFAATTGTQGYTVVKNDWK 

40 

A 

121 

20 AAAGCAGTAAAACAATTACAAGATGGACTAAAAGATAATAGTATAGGAAAGATAACTGTA 180 

+ + + + + 

41KAVKQLQDGLKDNSI GKITV 

60 

181 

25 TCTTTTAATGATGGGGTTGTGGGTGAAGTAGCTCCTAAAAGTGCTAATAAGAAAGCGGAC 24 0 

+ + + + + 

61 S FNDGVVGEVAPKSANKKAD 

80 

241 

30 AG AGAT G C T GC AG C T G AGAAG T T AT AT AAT C T T G T T AAC AC T C AAT T AG AT AAAT TAG G T 300 

+ + + + + 

81RDAAAEKLYNLVNTQLDKLG 

100 

301 

35 GAT G GAG AT TAT G T T GAT T T T T C T G TAG AT TAT AAT T T AG AAAAC AAAAT AAT AAC T AAT 360 

+ + + + + 

101 DGDYVDFSVDYNLENKIITN 

120 

361 

40 CAAGC AG AT GC AG AAG C AAT T G T T AC AAAG T T AAAT T C AC T T AAT GAG AAAAC T C T TAT T 420 

121 QADAEAIVTKLNSLNEKTLI 

140 

421 

45 GATATAGCAACTAAAGATACTTTTGGAATGGTTAGTAAAACACAAGATAGTGGAGGTAAA 4 8 0 

+ + + + + 

141 DIATKDTFGMVSKTQDSGGK 

160 
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481 

AATGTTGCTGCAACAAAGGCACTTAAAGTTAAAGATGTTGCTACATTTGGTTTGAAGTCT 54 0 

+ + + + + 

5 161 NVAATKALKVKDVATFGLKS 

180 

541 

GGTGGAAGCGAAGATACTGGATATGTTGTTGAAATGAAAGCAGGAGCTGTAGAGGATAAG 600 

+ + + + + 

10 181 GG S E D T GYVVE MKAGAVE DK 

200 

601 

TATGGTAAAGTTGGAGATAGTACGGCAGGTATTGCAATAAATCTTCCTAGTACTGGACTT 660 

+ + + + + 

15 201 YGKVGDSTAGIAINLPSTGL 

220 

661 

G AAT AT G C AG G T AAAGG AAC AAC AAT T GAT T T T AAT AAAAC T T T AAAAG T T GAT G T AAC A 720 

+ + + + + 

20 221 EYAGKGTTIDFNKTLKVDVT 

240 

721 

GGTGGTTCAACACCTAGTGCTGTAGCTGTAAGTGGTTTTGTAACTAAAGATGATACTGAT 7 80 

+ + + + + 

25 241 GGST PSAVAVSGFVTKDDTD 

260 

781 

T TAG C AAAAT C AG G T AC TAT AAA T G T AAG AG T T AT AAAT G C AAAAG AAG AAT C AAT T GAT 8 4 0 

+ + + + + 

30 261 LAKSGT INVRVINAKEES I D 

280 

841 

AT AG AT G C AAG C T CAT AT AC AT C AG C T G AAAAT T T AGC T AAAAGAT AT G T AT T T GAT C C A 900 

+ + + + + 

35 281 IDASSYTSAENLAKRYVFDP 

300 

901 

GATGAAATTTCTGAAGCATATAAGGCAATAGTAGCATTACAAAATGATGGTATAGAGTCT 960 

+ + + + + 

40 301 DEI SEAYKAIVALQNDGIES 

320 

961 

AATTTAGTTCAGTTAGTTAATGGAAAATATCAAGTGATTTTTTATCCAGAAGGTAAAAGA 1020 

+ + + + + 

45 321 NLVQLVNGKYQVI FYPEGKR 

340 

1021 

TTAGAAACTAAATCAGCAAATGATACAATAGCTAGTCAAGATACACCAGCTAAAGTAGTT 1080 

+ + + + + 

50 341 LETKSANDTIASQDTPAKVV 

360 

♦ 

1081 

55 AT AAAAGC T AAT AAAT T AAAAG AT T T AAAAGAT TAT G TAG AT GAT T T AAAAAC AT AT AAT 114 0 
+ + + + + 
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10 



15 



361 I KANKLKDLKDYVDDLKTYN 

380 

1141 

AAT AC T TAT T C AAAT G T T G T AAC AG T AG C AG G AG AAG AT AG AAT AG AAAC T G C T AT AG AA 1200 
+ + + + + 

381 NTYSNVVTVAGEDRI ETAI E 

400 

1201 

TTAAGTAGTAAATATTATAATTCTGATGATAAAAATGCAATAACTGATAAAGCAGTTAAT 12 60 
+ + + + + 

401 LSSKYYNSDDKNAITDKAVN 

420 

1261 

GAT AT AG TAT TAG T T G G AT C T AC AT C TAT AG T T GAT GGTCTTGTTG CAT CAC CAT TAG C T 1320 



421 DIVLVGSTSIVDGLVASPLA 

440 

1321 

T C AG AAAAAAC AG C T C CAT TAT TAT T AGC T T C AAAAG AT AAAT TAG AT T CAT C AG T AAAA 138 0 

20 + — + + + + 

441 SEKTAPLLLASKDKLDSSVK 

460 

1381 

T C T G AAAT AAAG AG AG T TAT G AAC T T AAAG AGT G AC AC T G G TAT AAAT AC T T C T AAAAAA 14 4 0 

25 + + + + + 

461 SEI KRVMNLKSDTGINTSKK 

480 

1441 

GTTTATTTAGCTGGTGGAGTTAATTCTATATCTAAAGATGTAGAAAATGAATTGAAAAAC 1500 

30 + + + + + 

481 VYLAGGVNS I SKDVENELKN 

500 

1501 

AT GG G T C T T AAAG T T AC TAG AT T ATC AG GAG AAG AC AG AT AC G AAAC T T C T T TAG CAATA 15 60 

35 + + + + + 

501 MGLKVTRLSGEDRYETSLAI 

520 

1561 

G C T GAT G AAAT AG G T C T T GAT AAT G AT AAAGCAT T T G TAG T T G G T GG T AC T G GAT T AGCA 1620 

40 + + + + + 

521 ADE IGLDNDKAFVVGGTGLA 

540 

1621 

GAT G C T AT GAG TAT AG C T C CAG TTGCTTCT CAAC T T AAAG AT G G AG AT GC T AC T C CAATA 168 0 

45 + + + + + 

541 DAMSIAPVASQLKDGDATPI 

560 

50 1681 

G TAG T T G TAG AT G G AAAAG C AAAAG AAAT AAG T GAT G AT GC T AAG AG T T T C T T AGG AAC T 17 4 0 

-; + + + + + 

561 VVVDGKAKEISDDAKSFLGT 

580 

55 1741 

TCTGATGTTGATATAATAGGTGGAAAAAATAGCGTATCTAAAGAGATTGAAGAGTCAATA 18 00 
+ + + + + 
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581 SDVDI IGGKNSVSKEIEESI 

600 

1801 

GATAGTGCAACTGGAAAAACTCCAGATAGAATAAGTGGAGATGATAGACAAGCAACTAAT 18 60 
601 DSATGKTPDRI SGDDRQATN 

620 

1861 

GCTGAAGTTTTAAAAGAAGATGATTATTTCACAGATGGTGAAGTTGTGAATTACTTTGTT 1920 
621 AEVLKEDDYFTDGEVVNYFV 

640 

1921 

GCAAAAGATGGTTCTACTAAAGAAGATCAATTAGTAGATGCCTTAGCAGCAGCACCAATA 1980 
641 AKDGS TKE DQLVDALAAAP I 

660 

1981 

GC AGG T AG AT T T AAGGAG T C T C C AG C T C CAAT CAT AC T AGC T AC T GAT AC T T TAT C T T C T 204 0 
661 AGRFKESPAPI ILATDTLSS 

680 

2041 

G AC C AAAAT G T AG C T G T AAG T AAAG C AG T T C C T AAAG AT GG T GG AAC T AAC T TAG T T C AA 2100 
25 + + + + + 

681 DQNVAVSKAVPKDGGTNLVQ 

700 

2101 G T AGG T AAAG G TAT AG C T T C T T C AG T TAT AAAC AAAAT GAAAG AT T TAT TAG AT AT G 
2157 

701 VGKGIASSVINKMKDLLDM 

719 



35 
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Appendix 5 

SEQ ID No. 7. Nucleotide sequence of slpA from Clostridium 
difficile strain 171862, PCR type 17, with translation. The 
5 putative secretory signal cleavage site (A) and site of cleavage to 

form the two mature SLPs (♦) are indicated. 

1 

ATGAATAAGAAAAACTTAGCAATGGCTATGGCAGCAGTTACTGTTGTGGGTTCTGCAGCG 60 

10 + + + + + 

1MNKKNLAMAMAAVTVVGSAA 

20 

61 

CCAATATTTGCAGATAGTACTACGCCAGGTTATACTGTAGTGAAAAATGATTGGAAAAAA 120 
15 + + + + + 

21PIFADSTTPGYTVVKNDWKK 

40 

A 

121 

20 G C AG T AAAAC AAT T AC AAG AT G GG T T G AAAAAT AAAAC TAT AT C AAC AAT AAAG G T G T C T 18 0 

+ + + + + 

41AVKQLQDGLKNKT ISTIKVS 

60 

181 

25 T T T AAT GG AAAC T C T G T T GGAG AAG T T AC AC CAGCCAGT T C T G GAG C AAAAAAAG C AGAT 24 0 

+ + + + + 

61 FNGNSVGEVTPAS SGAKKAD 

80 

241 

30 AG AG AT G C T G C AG C T G AAAAG T TAT AT AAT T TAG T AAAT AC AC AAT TAG AT AAAC T AGG T 300 

+ + + + + 

81 RDAAAEKLYNLVNTQLDKLG 

100 

301 

35 GATGGAGATTACGTTGACTTTGAAGTAACTTATAATTTAGCTACTCAAATAATTACAAAA 360 

+ + + + + 

101 DGDYVDFEVTYNLATQII TK 

120 

361 

40 GC AG AAGC AG AGG C AGT T C T T AC AAAAT T AC AAC AAT AT AAT GAT AAAG T AC T TAT AAAT 4 20 

+ + + + + 

121 AEAEAVLTKLQQYNDKVLIN 

140 

421 

45 T CT GC AAC AGAT AC AGT AAAAG G T AT GG TAT C T GAT AC AC AAG T T GAT AGC AAAAAT GT T 4 8 0 

+ + + + + 

141 SATDTVKGMVS DTQVDSKNV 

160 
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481 

G C AG C T AAC C C AC T T AAAG T TAG T GAT AT G T AT AC AAT AC CAT C T G C TAT TAG T G G AAG T 54 0 

+ + + + + 

161 AANPLKVS DMYTI PSAI TGS 

180 

541 

GATGATTCTGGGTATAGTATTGCTAAACCAACAGAAAAGACTACAaGTTTATTGTATGGT 600 

+ + + + + 

181 DDSGYSIAKPTEKTTSLLYG 

200 

601 

ACGGTTGGTGATGCAACTGCAGGTAAAGCAATAACAGTAGATACAGCTTCAAATGAAGCT 660 
201 TVGDATAGKAITVDTASNEA 

220 

661 

TTTGCTGGAAATGGAAAGGTTATTGACTACAATAAATCATTCAAAGCAACTGTACAAGGA 7 20 
221 FAGNGKVI DYNKS FKATVQG 

240 

721 

GATGGAACAGTTAAGACAAGCGGGGTTGTACTTAAAGATGCAAGTGATATGGCTGCAACA 7 80 
241 DGTVKTSGVVLKDAS DMAAT 

260 

781 

GGTACTATAAAAGTTAGAGTTACAAGTGCAAAAGAAGAATCTATTGATGTGGATTCAAGT 8 4 0 
261 GTI KVRVT SAKEES I DVDSS 

280 

841 

T CAT AT AT TAG T G C T G AAAAT T TAG C T AAAAAAT AT G TAT T T AAT C C T AAAG AG G T T T C T 900 
281 SYI SAENLAK KYVFNPKEVS 

300 

901 

G AAG C T TAT AAT G CAAT AG T T GC AT T AC AAAAT GAT G G AAT AG AAT C T GAT T TAG T AC AA 960 

+ + + + + 

301 EAYNAIVALQNDGI ESDLVQ 

320 

961 

T TAG T T AAT G G AAAAT AT C AAG T TAT T T T C TAT C C AG AAG G AAAAAGAT T AGAAAC T AAA 1020 
321 LVNGKYQV I FYPEGKRLETK 

340 
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1021 TCTGCAGATATAATAGCTGATGCAGATAGTCCAGCTAAAATAACTATAAAAGCTAATAAA 
1080 

341 SAD I IADADS PAKIT IKANK 

360 

♦ 

1081 

T T AAAAG A T T T AAAAG AT TAT G TAG AT GAT T T AAAAAC AT AC AAT AAT AC T T AC T C AAAT 114 0 
361 LKDLKDYVDDLKT YNNTYSN 

380 

1141 

GTTGTAACAGTAGCAGGAGAAGATAGAATAGAAACTGCTATAGAATTAAGTAGTAAATAT 1200 

+ + + + + 

381 VVTVAGEDRIETAIELSSKY 

400 

1201 

TATAATTCTGATGATAAAAATGCAATAACTGATGATGCAGTTAATAATATAGTATTAGTT 12 60 
401 YNS DDKNAIT DDAVNNIVLV 

420 

1261 

GGATCTACATCTATAGTTGATGGTCTTGTTGCATCACCATTAGCTTCAGAAAAAACAGCT 1320 

+ + + + + 

421 GSTSIVDGLVASPLASEKTA 

440 

1321 

CC AT TAT TAT T AAC T T C AAAAG AT AAAT TAG AT T CAT C AG T AAAAT C T G AG AT AAAAAG A 1380 
441 PLLLTSKDKLDSSVKSEIKR 

460 

1381 

GTTATGAACTTAAAGAGTGATACTGGTATAAATACTTCTAAAAAAGTTTATTTAGCTGGT 14 4 0 

+ + + + + 

461 VMNLKSDTGINTS KKVYLAG 

480 

1441 

GGAGTTAATTCTATATCTAAAGATGTAGAAGATGAATTGAAAAATATGGGCCTTAAAGTT 1500 

+ + + + + 

481 GVN S I SKDVEDELKNMGLKV 

500 

1501 

ACTAGATTATCAGGAGAAGACAGATACGAAACTTCTTTAGCAATAGCTGATGAAATAGGT 1560 



501 TRLSGEDRYETSLAIADEIG 

520 

1561 

CTTGATAATGATAAAGCATTTGTAGTTGGTGGTACTGGATTGGCAGATGCTATGAGTATA 1620 
521 LDN DKAFVVGGTGLADAMS I 

540 



1621 

GCTCCAGTTGCTTCTCAACTTAAAGATGGAGATGCTACTCCAATAGTAGTTGTAGATGGA 1680 
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541 APVASQLKDGDATPIVVVDG 

560 

1681 

AAAGCAAAAGAAATAAGTGATGATGCTAAGAGTTTCTTAGGAACTTCTGATGTTGATATA 17 4 0 
561 KAKE I SDDAKS FLGTSDVDI 

580 

1741 

AT AGG T G G AAAAAAT AG C G T AT C T AAAG AG AT T G AAG AG T C AAT AG AT AG T G C AAC T GG A 18 00 
581 IGGKNSVSKEIEESIDSATG 

600 

1801 

AAAAC T CC AG AT AG AAT AAG T G GAG AT G AC AG AC AAG C AAC T AAT GC T G AAG T T T T AAAA 18 60 
601 KTPDRI SGDDRQATNAEVLK 

620 

1861 

GAAGATGATTATTTCAAAGATGGTGAAGTTGTGAATTACTTTGTTGCAAAAGATGGTTCT 1920 
621 EDDY FKDGEVVNYFVAKDGS 

640 

1921 

ACTAAAGAAGATCAATTAGTAGATGCATTAGCAGCAGCACCAATAGCAGGTAGATTTAAG 1980 
641 TKEDQLVDALAAAP IAGRFK 

660 

1981 

GAGTCTCCAGCTCCAATCATACTAGCTACTGATACTTTATCTTCTGACCAAAATGTAGCT 204 0 
661 ESPAPI ILATDTLSSDQNVA 

680 

2041 

GTAAGTAAAGCAGTTCCTAAAGATGGTGGAACTAACTTAGTTCAAGTAGGTAAAGGTATA 2100 
681 VSKAVPKDGGTNLVQVGKG I 

700 

2101 GCTTCTTCAGTTATAAACAAAATGAAAGATTTATTAGATATGTAA 214 5 
40 701 ASSVINKMKDLLDM* 715 
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Appendix 6 

SEQ ID No 8. Nucleotide sequence of slpA from Clostridium 
difficile strain 173644, PCR type 31, with translation. The 
putative secretory signal cleavage site (A) and site of cleavage to 

form the two mature SLPs (♦) are indicated. 
1 

AT G AAT AAG AAG GAT AT AGC AAT AG C TAT G T C AG GAT T AACAG TAT TAG C T T C T G CAGC A 60 

+ + + + + 

1MNKKDIAIAMSGLTVLASAA 

20 

61 

CCTGTATTTGCTGCTAGTAGTTTTACAGCAGATTATAATTATACTGTAGTGCAAGGAAAA 120 

+ + + + + 

21 P V F A A S S FTADYNYTVVQGK 

40 

A 

121 

TAT C AAAAAG T TAT AAC T GG AT T AC AAG AT G G T T T AAAAAATG G AAAAAT AAC AAAT AT T 18 0 

+ + + + + 

41 YQK VITGLQDGLKNGKI TNI 

60 

181 

GATGTAATATTTGATGGAAGTTCAATTGGTGAGGTAGTGCCAGGTTCTGATGCTGCAGCT 24 0 
61DVI FDGS SIGEVVPGSDAAA 

80 

241 

GCAGCTACTAAATTAAAAAGTTTAGTTGATGATAAGTTAGATAACTTAGGTGATGGAAAA 300 

+ + + + + 

81 AAT KLKSLVDDKLDNLGDGK 

100 

301 

TACGTTCAATTTAATGTTACTTATACTACTAAATCTATAATAACTAAAGCAGAATTAAAA 360 
101 YVQ FNVTYTTKSI ITKAELK 

120 

361 

AATTATTATAATCAATTAGAAAGTAGTAAAGATAGAATACTTATAGGAAATGAACCTCAA 4 20 

+ + + + + 

121 NYYNQLESSKDRILIGNEPQ 

140 

421 

GATACAGGAACTAAAGGTCTTATAAAAGCTGATACTGATGGTACTACTGCTGTTGCAGCA 4 8 0 

+ + + + + 

141 DTGTKGLIKADTDGTTAVAA 

160 



481 

GC T GC AC CAT T G AAAT TAT C AGAT AT AT T T AC G T T TAG T TAT GAT G AAG T AAC AGG T G T A 54 0 

+ + + + + 

161 AAPLKLSDIFTFSYDEVTGV 

180 



WO 02/062379 



46 



PCT/IE02/00017 



541 

CTTAAAGCAGAACCAACAAGTAAAGTAAGCGCTGGTAAAGTTCAAGGTCTAAAATATGGA 600 

+ + + + + 

181 LKAE PTSKVSAGKVQGLKYG 

5 200 

601 

AATACAGGAGCAACTAACTATACTTCTGGAGCTGAAATATCTGTTCCTACTACAGGCTTA 660 

1+ + + + + 

201 NTGATNYTSGAEI SVPTTGL 

10 220 

661 

ACATTAACTGCTGATACAACTGCAACAACAGATGTAAATATTTCTGATGTTATGAGTGCA 72 0 

+ + + + + 

221 TLTAD TTATT DVN I S DVMSA 

15 240 

721 

TTTAAATTTAATGGTACTGATACGATTAGTGGATTCCCAGCTGGTTCATCAGCTTCTACT 7 8 0 

+ + + + + 

241 FKFNGTDTISGFPAGSSAST 

20 2 60 

781 

CTTAGAGCAAGTATAAAAGTAATAAATGCAAAAGAAGAATCTATAGATGTTGATTCAAGT 8 4 0 

+ + + + + 

261 LRAS I KVINAKEESI DVDS S 

25 280 

841 

T C AC AT AG AAC AG C T G AAG AT T TAG C T GAAAAAT AT G T AT T T AAAC C AGAAGAT G T GAAT 900 

+ + + + + 

281 SHRTAEDLAEKYVFKPEDVN 

30 300 

901 

AAAAC T TAT GAG G C AC T G AC T GAT T T AT AT AAAG AAG G TAT AAC AAG T AAT C T TAT C AC T 960 

+ + + + + 

301 KTYEALTDLYKEGITSNLIT 

35 320 

961 

C AAGAT G G T GG AAAAT AT C AAGT T G T T T T AT T T GC T C AAG GAAAG AGAT T AAC T AC T AAA 1020 
+ + + + + 

321 QDGGKYQVVLFAQGKRLTTK 

40 340 

1021 

G GAG C AAC T GG AAC T T TAG C AG AT G AAAAT TCTCCTCT T AAAG T AAC AAT AAAAG C AG AT 1080 
+ + + + + 

341 GATGTLADENS PLKVTI KAD 

45 360 

1081 

AAAG T AAAAG AC T T AAAAG AT TAT G T T G AAG AT T T AAAAAAT G C T AAC AAT G GAT AT T C A 114 0 

50 + + + + + 

361 KVKDLKDYVEDLKNANNGYS 

380 

1141 

AATTCTGTTGTTGTAGCAGGTGAAGATAGAATAGAAACAGCAATAGAGTTAAGTAGCAAA 12 00 

55 + + + + + 
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381 NSVVVAGEDRI ETAI ELSSK 

400 

1201 

TACTATAACTCTGATGATGACAATGCAATAACTAAAGATCCAGTTAACAATGTTGTTTTA 1260 

5 + + + + + 

401 YYNS DDDNAITKDPVNNVVL 

420 

1261 

GTTGGTTCTCAAGCTGTAGTTGATGGGCTTGTAGCTTCACCTTTAGCATCTGAAAAAAGA 1320 

10 + + + + + 

421 VGSQAVVDGLVAS PLAS EKR 

440 

1321 

GCTCCTTTACTATTAACTTCAGCAGGAAAATTAGATTCAAGTGTTAAAGCTGAGTTGAAA 138 0 

15 + + + + + 

441 APLLLTSAGKLDSSVKAELK 

460 

1381 

AGAGTAATGGATTTAAAATCTACAACAGGTGTAAATACTTCTAAAAAAGTTTACTTAGCT 14 4 0 

20 + + + + + 

461 RVMDLKSTTGVNTSKKVYLA 

480 

1441 

GGTGGAGTAAACTCTATATCTAAAGATGTAGAAAATGAATTAAAAGATATGGGACTTAAA 1500 

25 + + + + + 

481 GGVNS ISKDVENELKDMGLK 

500 

1501 

GTTACAAGATTATCAGGAGATGATAGATATGAAACTTCTTTAGCTATAGCTGATGAAATA 15 60 

30 + + + + + 

501VTR. LSGDDRYETSLAIADEI 

520 

1561 

G G T C T T G AT AAT G AT AAAG C T T T T G T AG T T G GAG G AAC AG GAT T AG CGG AT G C TAT GAG T 1620 

35 + + + + + 

521 GLDNDKAFVVGGTGLADAMS 

540 

1621 

AT AG C T C C AG TTGCTTCT CAAT T AAG AAAC T C AAAT GG AG AAC T T G AC T T AAAAGGT GAT 1680 

40 + + + + + 

541 IAPVASQLRNSNGELDLKGD 

560 

45 1681 

GCAACTCCAATAGTAGTTGTTGATGGAAAAGCTAAAGATATAAATTCTGAAGTAAAAGAT 17 4 0 

+ + + + + 

561 ATPIVVVDGKAKDINSEVKD 

580 

50 1741 

TTCTTAGATGATTCACAAGTTGATATAATAGGTGGTGTAAATAGTGTTTCTAAAGAAGTA 1800 

+ + + + + 

581 FLDDSQVDI I GGVNSVS KEV 

600 

55 1801 

ATGGAAGCAATAGATGATGCTACTGGAAAATCACCTGAGAGATATAGTGGAGAAGATAGA 18 60 
+ + + + + 
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601 MEAI DDATGKS PERYSGEDR 

620 

1861 

CAAGCTVACAAATGCTAAAGTTATAAAAGAAGATGATTTCTTTAZVAAATGGAGAAGTTACA 1920 

5 + + + + + 

621 QAT NAKVI KEDDFFKNGEVT 

640 

1921 

AACTTCTTTGTAGCTAAAGATGGTTCAACTAAAGAAGATCAATTAGTAGATGCTTTAGCA 1980 

10 + + + + + 

641 N FFVAKDGSTKE DQLVDALA 

660 

1981 

GGTGCTGCAATTGCTGGTAACTTTGGTGTAACAGTAGATAATGAAGGAAAACCTACAGTT 204 0 

15 + + + + + 

661 GAAIAGNFGVTVDNEGKPTV 

680 

2041 

GCTGATAAAAAAGCTTCTCCAGCACCAATTGTTTTAGCAACAGATTCTTTATCTTCTGAT 2100 

20 + + + + + 

681 ADKKASPAPIVLATDSLSSD 

700 

2101 

CAAAATGTAGCTATAAGTATVAGCTGTAAATGATGACGCTAATACTAAGAATCTAGTTCAA 2160 

25 + + + + + 

701 QNVAI SKAVNDDANTKNLVQ 

720 

2161 GTTGGTAAAGGTATAGCTACTTCAGTTGTAAGTAAAATAAAAGATTTATTAGATATG 
2217 

30 + + + + + 

721 VGKGIATSVVSKIKDLLDM 

739 



35 
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Appendix 7 

SEQ ID No 9. Nucleotide sequence of slpA from Clostridium difficile 
strain 170444, PCR type 46, with translation. The putative 
5 secretory signal cleavage site (A) and site of cleavage to form the 

two mature SLPs (♦) are indicated. 
1 

ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCT 60 

10 + + + + + 

1MNKKN IAIAMSGLTVLASAA 

20 

61 

CCTGTTTTTGCTGCAACTACTGGAACACAAGGTTATACTGTAGTTAAAAACGACTGGAAA 12 0 

15 + + + + + 

21 PVFAATTGTQGYTVVKNDWK 

40 

A 

121 

20 AAAGCAGTAAAACAATTACAAGATGGACTAAAAGATAATAGTATAGGAAAGATAACTGTA 18 0 

+ + + + + 

41KAVK QLQDGLKDNS I GKI TV 

60 

181 

25 TCTTTTAATGATGGGGTTGTGGGTGAAGTAGCTCCTAAAAGTGCTAATAAGAAAGCGGAC 2 4 0 

+ + + + + 

61SFNDGVVGEVAPKSANKKAD 

80 

241 

30 AGAGATGCTGCAGCTGAGAAGTTATATAATCTTGTTAACACTCAATTAGATAAATTAGGT 300 

+ + + + + 

81RDAAAEKLYNLVNTQLDKLG 

100 

301 

3 5 GAT GG AG AT TAT G T T GAT T T T T C T G T AGAT T AT AAT T T AGAAAAAAAAAT AAT AAC T AAT 360 

+ + + + + 

101 DGDYVDFSVDYNLEKKIITN 

120 

361 

40 CAAGCAGATGCAGAAGCAATTGTTACAAAGTTAAATTCACTTAATGAGAAAACTCTTATT 4 20 

+ + + + + 

121 QADAEAIVTKLN SLNEKTLI 

140 

421 

45 GATATAGCAACTAAAGATACTTTTGGAATGGTTAGTAAAACACAAGATAGTGAAGGTAAA 4 8 0 

+ + + + + 

141 DIATKDTFGMVS KTQDSEGK 

160 
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481 

AATGTTGCTGCAACAAAGGCACTTAAAGTTAAAGATGTTGCTACATTTGGTTTGAAGTCT 540 

+ + + + + 

5 161NVAATKALKVKDVATFGLKS 

1 80 

541 

GGTGGAAGCGAAGAT ACTGGAT ATGTT ATTGAAATGAAAGCAGGAGCTGTAGAGGAT AAG 600 

+ + + + + 

JO 181 GGSEDTGYVIEMKAGAVEDK 

200 

601 

TATGGTAAAGTTGGAGATAGT ACGGCAGGTATTGCAAT AAATCTTCCTAGT ACTGGACTT 660 

+ + + + + 

15 201 YGKVGDSTAGIAI NLPSTGL 

220 

661 

GAATATGCAGGTAAAGGAACAACAATTGATTTTAATAAAACTTTAAAAGTTGATGTAACA 7 2 0 

+ + + + + 

20 221 EYAGKGTTIDFNKTLKVDVT 

240 

721 

GGTGGTTCAACACCTAGTGCTGTAGCTGTAAGTGGTTTTGTAACTAAAGATGAT ACTGAT 780 

25 241 GGSTPSAVAVSGFVTKDDTD 

260 

781 

TTAGCAAAATCAGGTACTATAAATGTAAGAGTTATAAATGCAAAAGAAGAATCAATTGAT 84 0 

+ + + + + 

30 261 LAKSGTINVRVINAKEESID 

280 

841 

ATAGATGCAAGCTCATAT ACATCAGCTGAAAATTTAGCTAAAAGACATGT ATTTGATCCA 90 0 

+ + + + + 

35 281IDASSYTSAENLAKRHVFDP 

300 

901 

GATGAAATTTCTGAAGCATAT AAGGCAAT AGTAGCATT ACAAAATGATGGT AT AGAGTCT 96 0 

40 301 DE I SEAYKAIVALQNDGIE S 

320 

961 

AATTT AGTTCAGTTAGTTAATGGAAAATATCAAGTGATTTTTTATCCAGAAGGTAAAAGA 10 20 

45 321 NLVQLVNGKYQVI FYPEGKR 

340 
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1021 T T AGAAAC T AAAT C AGC AAAT G AT ACAAT AG C T AGT C AAG AT AC AC C AG C T AAAG T AG T T 
1080 

+ + + + + 

341 LETKSANDT IAS QDT PAKVV 

360 

♦ 

1081 

AT AAAAGCT AAT AAAT T AAAAG AT T T AAAAG AT T ATGT AGAT GAT T T AAAAACAT AT AAT 114 0 
+ + + + + 

361 I KANKLKDLKDYVDDLKTYN 

380 

1141 

AATACTTATTCAAATGTTGTAACAGTAGCAGGAGAAGATAGAATAGAAACTGCTATAGAA 1200 
+ + + + + 

381 NTYSNVVTVAGEDRIETAIE 

400 

1201 

TTAAGTAGTAAATATTATAATTCTGATGATAAAAATGCAATAACTGATAAAGCAGTTAAT 12 60 
+ + + + + 

401 LSSKYYNSDDKNAI TDKAVN 

420 

1261 

GATATAGTATTAGTTGGATCTACATCTATAGTTGATGGTCTTGTTGCATCACCATTAGCT 132 0 
4. + + + + 

421 DIVLVGSTSIVDGLVASPLA 

440 

1321 

T C AGAAAAAAC AGC T C CAT TAT TAT T AAC T T C AAAAGAT AAAT TAG AT T CAT C AG T AAAA 1380 

+ + + + 4- 

441SEKTAPLLLTSKDK LDSSVK 

460 

1381 

T C T G AAA T AAAG AG AG T TAT G AAC T T AAAG AG T G AC AC T G GT AT AAAT AC T T C T AAAAAA 14 4 0 

+ + + + + 

461 SEIKRVMNLKSDTGINTSKK 

480 

1441 

GTTTATTTAGCTGGTGGAGTTAATTCTATATCTAAAGATGTAGAAAATGAATTGAAAAAC 1500 

4- + + + + 

481 VYLAGGVNS I SKDVENELKN 

500 

1501 

ATGGGTCTTAAAGTTACTAGATTATCAGGAGAAGACAGATACGAAACTTCTTTAGCAATA 1560 
+ + + + + 

501 MGLKVTRLSGEDRYETSLAI 

520 

1561 

GCTGATGAAATAGGTCTTGATAATGATAAAGCATTTGTAGTTGGTGGTACTGGATTAGCA 1620 
+ + + + + 

521 ADEI GLDNDKAFVVGGTGLA 

540 



1621 

GATGCTATGAGTATAGCTCCAGTTGCTTCTCAACTTAAAGATGGAGATGCTACTCCAATA 168 0 
+ + + + + 
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541 DAMS IAPVASQLKDGDATPI 

560 

1681 

GTAGTTGTAGATGGAAAAGCAAAAGAAATAAGTGATGATGCTAAGAGTTTCTTAGGAACT 174 0 

5 + + + + + 

561 VVVDGKAKEI S DDAKSFLGT 

580 

1741 

T C T GAT GT T G AT AT AAT AG G T G G AAAAAAT AG C G TAT C T AAAG AGAT T G AAG AG T C AAT A 1800 

10 + + + + + 

581 SDVDIIGGKNSVSKEIEESI 

600 

1801 

GAT AG T GC AAC T GG AAAAAC T C C AG AT AG AAT AAG T GG AG AT GAT AG AC AAG C AAC T AAT 18 60 

15 + + + + + 

601 DSATGKTPDRI SGDDRQATN 

620 

1861 

GC T G AAG T T T T AAAAG AAG AT GAT TAT T T C AC AG AT G G T G AAG T T G T G AAT T AC T T T G T T 1920 
621 AEVLKEDDYFT DGEVVNYFV 

640 

1921 

GCAAAAGATGGTTCTACTAAAGAAGATCAATTAGTAGATGCCTTAGCAGCAGCACCAATA 1980 
25 + + + + + 

641 AKDGSTKEDQLVDALAAAPI 

660 

1981 

GCAGGT AGAT TTAAGGAGTCTCCAGCTCCAATCAT ACT AGCT ACT GAT ACT TTATCTTCT 20 40 

30 + + + + + 

661 AGRFKESPAPI ILATDTLSS 

680 

2041 

GACCAAAATGTAGCTGT AAG TAAAGCAGTTCCT AAAG ATGGTGGAACTAACTTAGTTCAA 2100 

35 + + + + + 

681 DQNVAVSKAVPKDGGTNLVQ 

700 

2101 GTAGGTAAAGGTATAGCTTCTTCAGTTATAAACAAAATGAAAGATTTATTAGATATG 
2157 

40 + + + + + 

701 VGKGIASSVINKMKDLLDM 

719 



45 
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Appendix 8 

SEQ ID No 10. Nucleotide sequence of slpA from Clostridium difficile 
strain 170426, PCR type 92, with translation. The putative 
5 secretory signal cleavage site (A) and site of cleavage to form the 

two mature SLPs (♦) are indicated. 
1 

ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCT 60 

1MNKKNIAIAMSGLTVLASAA 

20 

61 

CCTGTTTTTGCTGCAACTACTGGAACACAAGGTTATACTGTAGTTAAAAACGACTGGAAA 12 0 
21 PVFAATTGTQGYTVVKNDWK 

40 

A 

121 

20 AAAG C AG T AAAAC AAT T AC AGG AT G G AC T AAAAG AT AAT AG T AT AGG AAAG AT AAC T G T A 180 

41 KAV. KQLQDGLKDNS I GKITV 

60 

181 

25 TCTTTTAATGATGGGGTTGTGGGTGAAGTAGCTCCTAAAAGTGCTAATAAGAAAGCGGAC 24 0 

+ + + + + 

61 SFNDGVVGEVAPKSANKKAD 

80 

241 

30 AGAGATGCTGCAGCTGAGAAGTTATATAATCTTGTTAACACTCAATTAGATAAATTAGGT 300 

+ + + + + 

81 RDAAAEKLYNLVNTQLDKLG 

100 

301 

35 GAT G GAG AT TAT G T T GAT T T T T C T G TAG AT TAT AAT T T AG AAAAAAAAAT AAT AAC T AAT 360 

+ + + + + 

101 DGDYVDFSVDYNLEKKI ITN 

120 

361 

40 C AAG CAG AT GC AGAAG C AAT T G T T AC AAAG T T AAAT T C AC T T AAT GAG AAAAC T C T TAT T 420 

+ + + + + 

121 QADAEAIVTKL NSLNEKTLI 

140 

421 

45 GAT AT AG C AAC T AAAG AT AC T T T T GG AAT G G T TAG T AAAAC AC AAG AT AG T G AAG G T AAA 4 8 0 

+ + + + + 

+ 

141 DIATKDTFGMVSKTQDSEGK 

160 

50 

481 

AATGTTGCTGCAACAAAGGCACTTAAAGTTAAAGATGTTGCTACATTTGGTTTGAAGTCT 54 0 

+ + + + + 

55 161 NVAATKALKVKDVAT FGLKS 

180 
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541 

GGTGGAAGCGAAGATACTGGATATGTTGTTGAAATGAAAGCAGGAGCTGTAGAGGATAAG 600 
181 GGS E DTGYVVEMKAGAVEDK 

5 200 

601 

TATGGTAAAGTTGGAGATAGTACGGCAGGTATTGCAATAAATCTTCCTAGTACTGGACTT 660 
201 YGKVGDSTAGIAINLPSTGL 

10 220 

661 

G AAT AT GC AGG T AAAG G AAC AAC AAT T GAT T T T AAT AAAAC T T T AAAAG T T GAT G T AAC A 7 20 
221 EYAGKGTTIDFNKTLKVDVT 

15 240 

721 

GGTGGTTCAACACCTAGTGCTGTAGCTGTAAGTGGTTTTGTAACTAAAGATGATACTGAT 7 8 0 
241 GGSTPSAVAVSGFVTKDDTD 

20 260 

781 

T TAG C AAAAT C AGG T AC T AT AAAT G T AAGAG T T AT AAAT G C AAAAG AAG AAT C AAT T GAT 8 4 0 
261 LAKSGTINVRVI NAKEES I D 

25 280 

841 

ATAGATGCAAGCTCATATACATCAGCTGAAAATTTAGCTAAAAGATATGTATTTGATCCA 900 
281 I DAS SYTSAENLAKRYVFDP 

30 300 

901 

GAT G AAAT T T C T G AAG CAT AT AAG G C AAT AG T AGC AT T AC AAAAT GAT G G TAT AG AG T C T 960 
301 DEI SEAYKAIVALQNDGIES 

35 320 

961 

AAT T TAG T T C AG T TAG T T AAT GG AAAAT AT C AAG T GAT T T T T TAT C C AG AAG G T AAAAG A 1020 
+ + + + + 

321 NLVQLVNGKYQVI FYPEGKR 

40 340 

1021 

T T AGAAAC T AAAT C AG C AAA T G AT ACAAT AGC T AG TC AAGAT AC AC C AG C T AAAG T AGT T 1080 
+ + + + + 

341 LET KSANDTIASQDTPAKVV 

45 360 

♦ 

1081 

ATAAAAGCTAATAAATTAAAAGATTTAAAAGATTATGTAGATGATTTAAAAACATATAAT 114 0 

50 + + + + + 

361 IKANKLKDLKDYVDDLKTYN 

38 0 

1141 

AATACTTATTCAAATGTTGTAACAGTAGCAGGAGAAGATAGAATAGAAACTGCTATAGAA 12 00 



WO 02/062379 



55 



PCT/IE02/00017 



381 NTYSNVVTVAGEDRI ETAI E 

400 

1201 

TTAAGTAGTAAATATTATAATTCTGATGATAAAAATGCAATAACTGATAAAGCAGTTAAT 12 60 
401 LSSKYYNSDDKNAITDKAVN 

420 

1261 

GATATAGTATTAGTTGGATCTACATCTATAGTTGATGGTCTTGTTGCATCACCATTAGCT 1320 
421 DIVLVGSTSIVDGLVAS PLA 

440 

1321 

T CAGAAAAAACAGC T C CAT TAT TAT T AAC T T CAAAAG AT AAAT T AGAT T CAT C AG T AAAA 138 0 
441 SEKTAPLLLTSKDKLDS SVK 

460 

1381 

TCTGAAATAAAGAGAGTTATGAACTTAAAGAGTGACACTGGTATAAATACTTCTAAAAAA 14 4 0 
461 SEIKRVMNLKSDTGINTSKK 

480 

1441 

G T T TAT T T AGC T GG T G GAG T T AAT T C TAT AT C T AAAG AT GT AGAAAAT GAAT T G AAAAAC 1500 
481 VYLAGGVNS I SKDVENELKN 

500 

1501 

ATGGGTCTTAAAGTTACTAGATTATCAGGAGAAGACAGATACGAAACTTCTTTAGCAATA 1560 
501 MGLKVTRLSGEDRYETS LAI 

520 

1561 

GCTGATGAAATAGGTCTTGATAATGATAAAGCATTTGTAGTTGGTGGTACTGGATTAGCA 1620 
521 ADE IGLDNDKAFVVGGTGLA 

540 

1621 

GATGCTATGAGTATAGCTCCAGTTGCTTCTCAACTTAAAGATGGAGATGCTACTCCAATA 168 0 
541 DAMSIAPVASQLKDGDATPI 

560 



45 1681 

G TAG T T G TAG AT G G AAAAGC AAAAG AAAT AAG T GAT GAT GC T AAG AG T T T C T T AGGAAC T 17 4 0 
561 VVVDGKAKEI SDDAKSFLGT 

580 

50 1741 

T C T GAT G T T GAT AT AAT AG G T G G AAAAAAT AG C G TAT C T AAAG AG AT T G AAG AG T C AA T A 18 00 
+ + + + + 

581 SDVDIIGGKNSVSKEIEESI 

600 

55 1801 

GAT AG T G C AAC T G G AAAAAC T C C AG AT AG AAT AAG T GG AG AT GAT AG AC AAG C AAC T AAT 18 60 
+ + + + + 
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601 DSATGKTPDRISGDDRQATN 

620 

1861 

GCTGAAGTTTTAAAAGAAGATGATTATTTCACAGATGGTGAAGTTGTGAATTACTTTGTT 1920 
5 + + + + + 

621 AEVLKEDDY FTDGEVVNYFV 

640 

1921 

GCAAAAGATGGTTCTACTAAAGAAGATCAATTAGTAGATGCCTTAGCAGCAGCACCAATA 1980 

10 +- + + + + 

641 AKDGSTKEDQLVDALAAAP I 

660 

1981 

GCAGGTAGATTTAAGGAGTCTCCAGCTCCAATCATACTAGCTACTGATACTTTATCTTCT 204 0 

15 + + + + + 

661 AGRFKESPAPI ILATDTLS S 

680 

2041 

GACCAAAATGTAGCTGTAAGTAAAGCAGTTCCTAAAGATGGTGGAACTAACTTAGTTCAA 2100 

20 + + + + + 

681 DQNVAVSKAVPKDGGTNLVQ 

700 

2101 G T AG G T AAAG G TAT AG C T T C T T C AG T TAT AAAC AAAAT G AAAG AT T TAT TAG AT AT G 
2157 

25 + + + + + 

701 VGKGIASSVI NKMKDLLDM 

719 



30 
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Claims 

1. A vaccine for the treatment or prophylaxis of C difficile associated disease, 
the vaccine comprising a C. difficile gene or a C difficile peptide/polypeptide 
or a derivative or fragment or mutant or variant thereof which is 
immunogenic in humans. 



2. A vaccine for the treatment or prophylaxis of C. difficile associated disease, 
the vaccine comprising a C. difficile gene or C. difficile peptide/polypeptide 

10 or a derivative or fragment or mutant or variant thereof to which 

immunoreactivity is detected in individuals who have recovered from C. 
difficile infection. 

3. A vaccine as claimed in claim 1 or 2 wherein the gene encodes a C. difficile 
15 surface layer protein, Sip A or variant or homologue thereof. 

4. A vaccine as claimed in claim 1 or 2 wherein the peptide/polypeptide is a C. 
difficile surface layer protein, SlpA or variant or homologue thereof. 

20 5. A vaccine as claimed in any of claims 1 to 4 wherein the vaccine comprises a 

chimeric nucleic acid sequence. 

6. A vaccine as claimed in 5 wherein the chimeric nucleic acid sequence is 
derived from the 5' end of the gene, encoding the mature N-terminal moiety 

25 of SlpA from C. difficile. 

7. A vaccine as claimed in any of claims 1 to 4 wherein the vaccine comprises a 
chimeric peptide/polypeptide. 



30 8. 



A vaccine as claimed in 7 wherein the amino acid sequence of the chimeric 
peptide/polypeptide is derived from the mature N-terminal moiety of SlpA 
from C. difficile. 
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9. A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains an 
amino acid sequence SEQ ID No.l or a derivative or fragment or mutant or 
variant thereof. 

5 10. A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains an 

amino acid sequence SEQ ID No. 2 or a derivative or fragment or mutant or 
variant thereof. 

11. A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains a 
10 nucleotide sequence SEQ ID No.3 or a derivative or fragment or mutant or 

variant thereof. 

12. A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains a 
nucleotide sequence SEQ ID No.4 or a derivative or fragment or mutant or 

15 variant thereof. 

13. A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains a 
nucleotide sequence SEQ ID No. 5 or a derivative or fragment or mutant or 
variant thereof. 

20 

14. A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains a 
nucleotide sequence SEQ ID No.6 or a derivative or fragment or mutant or 
variant thereof. 

25 15. A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains a 

nucleotide sequence SEQ ID No. 7 or a derivative or fragment or mutant or 
variant thereof. 



16. 

30 



A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains a 
nucleotide sequence SEQ ID No.8 or a derivative or fragment or mutant or 
variant thereof. 
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17. A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains a 
nucleotide sequence SEQ ID No.9 or a derivative or fragment or mutant or 
variant thereof. 

5 18. A vaccine as claimed in any of claims 1 to 8 wherein the vaccine contains a 

nucleotide sequence SEQ ID No. 10 or a derivative or fragment or mutant or 
variant thereof. 

19. A vaccine as claimed in any preceding claim in combination with at least one 
10 other C. difficile sub-unit. 

20. A vaccine for the treatment or prophylaxis of C. difficile associated disease, 
the vaccine comprising the mature N-terminal moiety of a surface layer 
protein, SlpA of C. difficile or variant or homologue thereof which is 

15 immunogenic in humans. 

21. A vaccine as claimed in claim 20 wherein the N-terminal moiety of SlpA 
contains an amino acid sequence SEQ ID No. 1. 

20 22. A vaccine as claimed in claim 20 wherein the N-terminal moiety of SlpA 

contains an amino acid sequence SEQ ID No. 2. 

23. A vaccine for the treatment or prophylaxis of C. difficile associated disease, 
the vaccine comprising an immunodominant epitope derived from a C. 

25 difficile gene or a C. difficile peptide/polypeptide or a derivative or fragment 

or mutant or variant thereof which is immunogenic in humans. 

24. A vaccine as claimed in any preceding claim comprising a pharmaceutically 
acceptable carrier. 



30 



25. A vaccine as claimed in any preceding claim in combination with a 
pharmacologically suitable adjuvant. 
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26. A vaccine as claimed in claim 25 wherein the adjuvant is interleukin 12. 

27. A vaccine as claimed in claim 25 or 26 wherein the adjuvant is a heat shock 
protein. 

28. A vaccine as claimed in any preceding claim comprising at least one other 
pharmaceutical product. 

29. A vaccine as claimed in claim 28 wherein the pharmaceutical product is an 
antibiotic. 

30. A vaccine as claimed in claim 29 wherein the antibiotic is selected from one 
or more metronidazole, amoxycillin, tetracycline or erythromycin, 
clarithromycin or tinidazole. 

31. A vaccine as claimed in claim 28 wherein the pharmaceutical product 
comprises an acid-suppressing agent such as omeprazole or bismuth salts. 

32. A vaccine as claimed in any preceding claim in a form for oral 
administration. 

33. A vaccine as claimed in any preceding claim in a form for intranasal 
administration. 

34. A vaccine as claimed in any preceding claim in a form for intravenous 
administration. 

35. A vaccine as claimed in any preceding claim in a form for intramuscular 
administration. 

36. A vaccine as claimed in any of claims 1 to 35 including a peptide delivery 
system. 
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37. An immunodominant epitope derived from a C. difficile gene or a C. difficile 
peptide/polypeptide or a derivative or fragment or mutant or variant thereof. 

38. An immunodominant epitope as claimed in claim 37 wherein the C. difficile 
5 peptide/polypeptide contains an amino acid sequence SEQ ID No.l or SEQ 

ID No.2 or a derivative or fragment or mutant or variant thereof. 

39. An immunodominant epitope as claimed in claim 35 wherein the C. difficile 
peptide/polypeptide contains an amino acid sequence SEQ ID No. 3 or SEQ 

10 ID No.4 or SEQ ID No.5 or SEQ ID No.6 or SEQ ID No.7 or SEQ ID No.8 

or SEQ ID No. 9 or SEQ ID No. 10 or a derivative or fragment or mutant or 
variant thereof. 

40. A chimeric nucleic acid sequence derived from the 5' end of the slpA gene 
15 encoding the mature N-terminal moiety of SlpA from C. difficile which is 

immunogenic in humans. 

41. A chimeric peptide/polypeptide wherein the amino acid sequence of the 
chimeric peptide/polypeptide is derived from the mature N-terminal moiety 

20 of SlpA from C. difficile. 

42. A C. difficile peptide comprising SEQ ID No. 1. 

43. A C. difficile peptide comprising SEQ ID No. 2. 

25 

44. A C. difficile gene comprising SEQ ID No. 3. 

45. A C. difficile gene comprising SEQ ID No. 4. 



30 46. 



A C. difficile gene comprising SEQ ID No. 5. 
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10 



47. A C difficile gene comprising SEQ ID No. 6. 

48. A C. difficile gene comprising SEQ ID No. 7. 

49. A C. difficile gene comprising SEQ ID No. 8. 

50. A C. difficile gene comprising SEQ ID No. 9. 

51. AC. difficile gene comprising SEQ ID No. 10. 



52. The use of a C. difficile gene or a C. difficile peptide/polypeptide or a 
derivative or fragment or mutant or variant thereof which is immunogenic in 
humans in the preparation of a medicament for use in a method for the 
treatment or prophylaxis of C. difficile infection or C. difficile associated 

15 disease in a host. 

53. The use as claimed in claim 52 wherein the medicament which is prepared is 
a vaccine as claimed in any of claims 1 to 36. 

20 54. A method for preparing a vaccine for prophylaxis or treatment of C. difficile 

associated disease, the method comprising; 

obtaining a C. difficile gene or a C. difficile peptide/polypeptide or a 
derivative or fragment or mutant or variant thereof which is 
25 immunogenic in humans; and 

forming a vaccine preparation comprised of said gene or 
peptide/polypeptide or derivative or fragment or mutant or variant, 
which is suitable for administration to a host and which when 
30 administered raises an immune response. 
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55. A method as claimed in claim 54 wherein the C. difficile peptide/polypeptide 
contains an amino acid sequence SEQ ID No.l or SEQ ID No.2 or a 
derivative or fragment or mutant or variant thereof. 

5 56. A method as claimed in claim 54 wherein the C. difficile gene contains an 

amino acid sequence SEQ ID No.3 or SEQ ID No.4 or SEQ ID No.5 or SEQ 
ID No.6 or SEQ ID No.7 or SEQ ID No.8 or SEQ ID No.9 or SEQ ID No.10 
or a derivative or fragment or mutant or variant thereof. 

10 57. A method for prophylaxis or treatment of C. difficile associated disease, the 

method comprising; 

obtaining a C. difficile gene or a C. difficile peptide/polypeptide or a 
derivative or fragment or mutant or variant thereof which is 
15 immunogenic in humans; 

forming a vaccine preparation comprised of said gene or 
peptide/polypeptide or derivative or fragment or mutant or variant, 
and 

20 

administering the vaccine preparation to a host to raise an immune 
response. 

58. Monoclonal or polyclonal antibodies or fragments thereof, to a C. difficile 
25 peptide/polypeptide or a derivative or fragment or mutant or variant thereof 

which is immunogenic in humans. 

59. Monoclonal or polyclonal antibodies or fragments thereof, to C. difficile 
peptide/polypeptide or a derivative or fragment or mutant or variant thereof 

30 to which immunoreactivity is detected in individuals who have recovered 

from C. difficile infection. 
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60. Purified antibodies or serum obtained by immunisation of an animal with a 
vaccine according to any of claims 1 to 36. 

61. The use of the antibodies or fragments as claimed in claims 58 and 59 in the 
5 preparation of a medicament for treatment or prophylaxis of C. difficile 

infection or C. difficile associated disease. 

62. The use of the antibodies or serum as claimed in 60 in the preparation of a 
medicament for treatment or prophylaxis of C. difficile infection or C. 

10 difficile associated disease. 

63. The use of the antibodies or fragments or serum as claimed in any of claims 
58 to 60 for use in passive immunotherapy for established C. difficile 
infection. 



15 



20 



64. The use of the antibodies or fragment or serum as claimed in any of claims 58 
to 60 for the eradication of C. difficile associated disease. 

65. Use of interleukin 12 as an adjuvant in C. difficile vaccine. 

66. The use of humanised antibodies or serum for passive vaccination of an 
individual with C. difficile infection. 
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SEQUENCE LISTING 

<110> The Provost, Fellows & Scholars of t 
Trinity of Queen Elizabeth, near Dublin 

<120> C. difficile vaccine 

<130> TRI002/C/WO 

<160> 10 

<170> Patentln version 3.1 

<210> 1 
<211> 2157 
<212> DNA 

<213> Clostridium difficile 
<400> 1 

atgaataaga aaaatatagc aatagctatg tcaggtttaa 
cctgtttttg ctgcaactac tggaacacaa ggttatactg 
aaagcagtaa aacaattaca ggatggacta aaagataata 
tcttttaatg atggggttgt gggtgaagta gctcctaaaa 
agagatgctg cagctgagaa gttatataat cttgttaaca 
gatggagatt atgttgattt ttctgtagat tataatttag 
caagcagatg cagaagcaat tgttacaaag ttaaattcac 
gatatagcaa ctaaagatac ttttggaatg gttagtaaaa 
aatgttgctg caacaaaggc acttaaagtt aaagatgttg 
ggtggaagcg aagatactgg atatgttgtt gaaatgaaag 
tatggtaaag ttggagatag tacggcaggt attgcaataa 
gaatatgcag gtaaaggaac aacaattgat tttaataaaa 
ggtggttcaa cacctagtgc tgtagctgta agtggttttg 
ttagcaaaat caggtactat aaatgtaaga gttataaatg 
atagatgcaa gctcatatac atcagctgaa aatttagcta 
gatgaaattt ctgaagcata taaggcaata gtagcattac 
aatttagttc agttagttaa tggaaaatat caagtgattt 
ttagaaacta aatcagcaaa tgatacaata gctagtcaag 
ataaaagcta ataaattaaa agatttaaaa gattatgtag 
aatacttatt caaatgttgt aacagtagca ggagaagata 
ttaagtagta aatattataa ttctgatgat aaaaatgcaa 
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/ion 
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CdCddydCdy 


LgaaggLaaa 


/ion 
4 o U 


ctacatttgg 


tttgaagtct 


540 


caggagctgt 


agaggataag 


600 


atcttcctag 


tactggactt 


660 


ctttaaaagt 


tgatgtaaca 


720 


taactaaaga 


tgatactgat 


780 


caaaagaaga 


atcaattgat 


840 


aaagatatgt 


atttgatcca 


900 


aaaatgatgg 


tatagagtct 


960 


tttatccaga 


aggtaaaaga 


1020 


atacaccagc 


taaagtagtt 


1080 


atgatttaaa 


aacatataat 


1140 


gaatagaaac 


tgctatagaa 


1200 


taactgataa 


agcagttaat 


1260 



1380 
1440 
1500 
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gatatagtat tagttggatc tacatctata gttgatggtc ttgttgcatc accattagct 1320 
tcagaaaaaa cagctccatt attattaact tcaaaagata aattagattc atcagtaaaa 
tctgaaataa agagagttat gaacttaaag agtgacactg gtataaatac ttctaaaaaa 
gtttatttag ctggtggagt taattctata tctaaagatg tagaaaatga attgaaaaac 
atgggtctta aagttactag attatcagga gaagacagat acgaaacttc tttagcaata 1560 
gctgatgaaa taggtcttga taatgataaa gcatttgtag ttggtggtac tggattagca 1620 
gatgctatga gtatagctcc agttgcttct caacttaaag atggagatgc tactccaata 1680 
gtagttgtag atggaaaagc aaaagaaata agtgatgatg ctaagagttt cttaggaact 1740 
tctgatgttg atataatagg tggaaaaaat agcgtatcta aagagattga agagtcaata 
gatagtgcaa ctggaaaaac tccagataga ataagtggag atgatagaca agcaactaat 
gctgaagttt taaaagaaga tgattatttc acagatggtg aagttgtgaa ttactttgtt 1920 
gcaaaagatg gttctactaa agaagatcaa ttagtagatg ccttagcagc agcaccaata 1980 
gcaggtagat ttaaggagtc tccagctcca atcatactag ctactgatac tttatcttct 
gaccaaaatg tagctgtaag taaagcagtt cctaaagatg gtggaactaa cttagttcaa 
gtaggtaaag gtatagcttc ttcagttata aacaaaatga aagatttatt agatatg 

<210> 2 

<211> 1830 

<212> DNA 

<213> Clostridium difficle 



1800 
1860 



2040 
2100 
2157 



<400> 2 

atgaaaaaaa gaaatttagc aatggctatg gcagctgtta ctgtagtagg ttctgctgct 
ccagtttttg cagcagcttc agatgtaata tcactacaag atggtacaaa tgataagtat 
acagtatcaa atactaaagc tagtgactta gtaaaggata ttttagcagc acaaaactta 
acaacaggtg cagttatttt gaacaaagat acaaaagtta ctttctatga tgcaaatgag 240 
aaagattctt caactccaac tggagataaa aaagtttatt cagaacaaac tttaactaca 
gctaatggaa atgaagatta tgtaaagaca actttaaaaa atttagatgc aggagaatat 
gctattatag atttaactta taataatgct aaaactgttg aaattaaagt agtagcagct 
agtgaaaaaa cagtagttgt atctagtgat gcgaaaaata gtgcaaaaga tatagctgaa 
aaatatgtgt ttgaagacaa agacttagaa aatgcactaa aaactataaa tgcctcagat 
ttcagtaaaa ctgatagtta ctatcaagta gttctttatc caaaaggaaa gagattacaa 
ggtttctcaa cttatagagc tacaaattat aatgaaggaa ctgcatatgg taatacacca 
gtaatattaa ctctaaaatc tactagtaag agtaatttaa agactgcagt agaagagtta 720 



60 
120 
180 



300 
360 
420 
480 
540 
600 
660 



840 
900 
960 



1140 
1200 
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caaaaattga atgctagtta ttctaatact acaactttag ctggtgatga cagaatacaa 780 
acagctatag agataagtaa agaatattac aataatgatg gcgagaaatc agatcattca 
gctgatgtta aagagaatgt taaaaatgtt gtattagtag gtgcaaatgc actagtagat 
ggattagttg cggctccttt agcagcagaa aaagatgctc cactattatt aacttcaaaa 
gataaattag attcgtcagt aaaatctgaa ataaagagag ttttagactt aaaaacttca 1020 
acagaagtaa caggaaaaac agtttatata gctggtggag ttaatagtgt atctaaagaa 1080 
gttgtaacag aattagaatc aatgggatta aaagttgaaa gattctcagg tgatgataga 
tatgaaactt ctttaaaaat agcaggtgaa ataggcttag ataatgataa ggcttatgta 
gttggtggaa caggattagc agatgccatg agtatagctt cagttgcttc tactaaatta 1260 
gatggtaatg gtgttgtaga tagaacaaat ggacatgcta ctccaatagt tgttgtagat 1320 
ggaaaagctg ataaaatatc tgatgactta gatagtttct taggaagcgc tgatgtagat 
ataataggtg gatttgcaag tgtatctgaa aagatggaag aagctatatc agatgctact 
ggtaaaggcg ttacaagagt taaaggcgac gatagacaag acactaactc tgaagttata 
aaaacatatt atgctaatga tactgaaata gctaaagctg cagttttaga taaagattca 
ggtgcttcaa gtagtgatgc aggagtattt aatttctatg tagctaaaga tggatctaca 
aaagaagatc aattagttga tgcattagca gtaggagctg ttgctggata taaacttgct 
ccagttgtat tagctactga ttctttatct tctgatcaat cggttgctat aagcaaagtt 1740 
gtaggagaaa aatattctaa agatttaaca caagttggtc aaggaatagc taattcagtt 
ataaacaaaa tgaaagattt attagatatg 

<210> 3 
<211> 2158 
<212> DNA 

<213> Clostridium difficile 
<400> 3 

atgaataaga aaaatatagc aatagctatg tcaggtttaa cagttttagc ttcggctgct 
cctgtttttg ctgcaactac tggaacacaa ggttatactg tagttaaaaa cgactggaaa 
aaagcagtaa aacaattaca agatggacta aaagataata gtataggaaa gataactgta 



1380 
1440 
1500 
1560 
1620 
1680 



1800 
1830 



60 
120 
180 



tcttttaatg atggggttgt gggtgaagta gctcctaaaa gtgctaataa gaaagcggac 240 
agagatgctg cagctgagaa gttatataat cttgttaaca ctcaattaga taaattaggt 
gatggagatt atgttgattt ttctgtagat tataatttag aaaacaaaat aataactaat 
caagcagatg cagaagcaat tgttacaaag ttaaattcac ttaatgagaa aactcttatt 420 



300 
360 



gatatagcaa ctaaagatac ttttggaatg gttagtaaaa cacaagatag tgaaggtaaa 



480 
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aatgttgctg 


caacaaaggc 


acttaaagtt 


aaagatgttg 


ctacatttgg 


tttgaagtct 


540 


ggtggaagcg 


aagatactgg 


atatgttgtt 


gaaatgaaag 


caggagctgt 


agaggataag 


600 


tatggtaaag 


ttggagatag 


tacggcaggt 


attgcaataa 


atcttcctag 


tactggactt 


660 


gaatatgcag 


gtaaaggaac 


aacaattgat 


tttaataaaa 


ctttaaaagt 


tgatgtaaca 


720 


ggtggttcaa 


cacctagtgc 


tgtagctgta 


agtggttttg 


taactaaaga 


tgatactgat 


780 


ttagcaaaat 


caggtactat 


aaatgtaaga 


gttataaatg 


caaaagaaga 


atcaattgat 


840 


atagatgcaa 


gctcatatac 


atcagctgaa 


aatttagcta 


aaagatatgt 


atttgatcca 
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gatgaaattt 


ctgaagcata 


taaggcaata 


gtagcattac 


aaaatgatgg 


tatagagtct 


960 


aacttagttc 


agttagttaa 


tggaaaatat 


caagtgattt 


tttatccaga 


aggtaaaaga 


1020 


ttagaaacta 


aatcagcaaa 


tgatacaata 


gctagtcaag 


atacaccagc 


taaagtagtt 


1080 


ataaaagcta 


ataaattaaa 


agatttaaaa 


gattatgtag 


atgatttaaa 


aacatataat 


1140 


aatacttatt 


caaatgttgt 


aacagtagca 


ggagaagata 


gaatagaaac 


tgctatagaa 


1200 


ttaagtagta 


aatattataa 


ttctgatgat 


aaaaatgcaa 


taactgataa 


agcagttaat 


1260 


gatatagtat 


tagttggatc 


tacatctata 


gttgatggtc 


ttgttgcatc 


accattagct 


1320 


tcagaaaaaa 


cagctccatt 


attattaact 


tcaaaagata 


aattagattc 


atcagtaaaa 


1380 


tctgaaataa 


agagagttat 


gaacttaaag 


agtgacactg 


gtataaatac 


ttctaaaaaa 


1440 


gtttatttag 


ctggtggagt 


taattctata 


tctaaagatg 


tagaaaatga 


attgaaaaac 


1500 


atgggtctta 


aagttactag 


attatcagga 


gaagacagat 


acgaaacttc 


tttagcaata 


1560 


gctgatgaaa 


taggtcttga 


taatgataaa 


gcatttgtag 


ttggtggtac 


tggattagca 


1620 


gatgctatga 


gtatagctcc 


agttgcttct 


caacttaaag 


atggagatgc 


tactccaata 


1680 


gtagttgtag 


atggaaaagc 


aaaagaaata 


agtgatgatg 


ctaagagttt 


cttaggaact 


1740 


tctgatgttg 


atataatagg 


tggaaaaaat 


agcgtatcta 


aagagattga 


agagtcaata 


1800 


gatagtgcaa 


ctggaaaaac 


tccagataga 


ataagtggag 


atgatagaca 


agcaactaat 


1860 


gctgaagttt 


taaaagaaga 


tgattatttc 


acagatggtg 


aagttgtgaa 


ttactttgtt 


1920 


gcaaaagatg 


gttctactaa 


agaagatcaa 


ttagtagatg 


ccttagcagc 


agcaccaata 


1980 


gcaggtagat 


ttaaggagtc 


tccagctcca 


atcatactag 


ctactgatac 


tttatcttct 


2040 


gaccaaaatg 


tagctgtaag 


taaagcagtt 


cctaaagatg 


gtggaactaa 


cttagttcaa 


2100 


gtaggtaaag 


gtatagcttc 


ttcagttata 


aacaaaatga 


aagatttatt 


agatatgg 


2158 



<210> 4 

<211> 2271 

<212> DNA 

<213> Clostridium difficile 



WO 02/062379 PCT/IE02/00017 

5/11 



<400> 4 



atgaataaga 


aaaatatagc 


aatagctatg 


tcaggtttaa 


cagttttagc 


ttcggctgca 


60 


cctgtatttg 


cagatgatac 


aaaagttgaa 


actggtgatc 


aaggatatac 


agtggtacaa 


120 


agcaagtata 


agaaagctgt 


tgaacaatta 


caaaaaggaa 


tattagatgg 


aagtataaca 


180 


gaaattaaag 


ttttctttga 


gggaacttta 


gcatctacta 


taaaagtagg 


ttctgagctt 


240 


aatgcagcag 


atgcaagtaa 


attattgttt 


acacaagtag 


ataataaact 


agataattta 


300 


ggtgatggag 


attatgtaga 


tttcttaata 


acttctccag 


gtcaagggga 


taaaataact 


360 


acaagtaaac 


ttgttgcatt 


gaaagattta 


acaggtgctt 


cagcagatgc 


tataattgct 


420 


ggaacatctt 


cagcagatgg 


tgttgttaca 


aatactggag 


ctgctagtgg 


ttctactgag 


480 


acaaattcag 


caggaacaaa 


acttgcaatg 


tcagctattt 


ttgacacagc 


atatacagat 


540 


tcatctgaaa 


ctgcggttaa 


gattactata 


aaagcagata 


tgaatgatac 


taaatttggt 


600 


aaagcaggtg 


agacaactta 


ttcaactggg 


cttacatttg 


aagatgggtc 


tacagaaaaa 


660 


attgttaaat 


taggggacag 


tgatattata 


gatataacta 


aagctcttaa 


acttactgtt 


720 


gttcctggaa 


gtaaagcaac 


tgttaagttt 


gctgaaaaaa 


caccaagtgc 


cagtgttcaa 


780 


ccagtaataa 


caaagcttag 


aataataaat 


gctaaagaag 


aaacaataga 


tattgacgct 


840 


agttctagta 


aaacagcaca 


agatttagct 


aaaaaatatg 


tatttaataa 


aactgattta 


900 


aatactcttt 


ataaagtatt 


aaatggagat 


gaagcagata 


ctaatggatt 


aatagaagaa 


960 


gttagtggaa 


aatatcaagt 


agttctttat 


ccagaaggaa 


aaagagttac 


aactaagagt 


1020 


gctgcaaagg 


cttcaattgc 


tgatgaaaat 


tcaccagtta 


aattaactct 


taagtcagat 


1080 


aagaagaaag 


acttaaaaga 


ttatgtggat 


gatttaagaa 


catataataa 


tggatattca 


1140 


aatgctatag 


aagtagcagg 


agaagataga 


atagaaactg 


caatagcatt 


aagtcaaaaa 


1200 


tattataact 


ctgatgatga 


aaatgctata 


tttagagatt 


cagttgataa 


tgtagtattg 


1260 


gttggaggaa 


atgcaatagt 


tgatggactt 


gtagcttctc 


ctttagcttc 


tgaaaagaaa 


1320 


gctcctttat 


tattaacttc 


aaaagataaa 


ttagattcaa 


gcgtaaaagc 


tgaaataaag 


1380 


agagttatga 


atataaagag 


tacaacaggt 


ataaatactt 


caaagaaagt 


ttatttagct 


1440 


ggtggagtta 


attctatatc 


taaagaagta 


gaaaatgaat 


taaaagatat 


gggacttaaa 


1500 


gttacaagat 


tagcaggaga 


tgatagatat 


gaaacttctc 


taaaaatagc 


tgatgaagta 


1560 


ggtcttgata 


atgataaagc 


atttgtagtt 


ggaggaacag 


gattagcaga 


tgccatgagt 


1620 


atagctccag 


ttgcatctca 


attaagaaat 


gctaatggta 


aaatggattt 


agctgatggt 


1680 


gatgctacac 


caatagtagt 


tgtagatgga 


aaagctaaaa 


ctataaatga 


tgatgtaaaa 


1740 


gatttcttag 


atgattcaca 


agttgatata 


ataggtggag 


aaaacagtgt 


atctaaagat 


1800 
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gttgaaaatg caatagatga tgctacaggt 
agacaagcaa ctaatgcaaa agttataaaa 
aatgataaaa aagtagttaa tttctttgta 
ttagttgatg ctttagcagc agctccagtt 
gatggtaagc cagtagataa agatggtaaa 
aaattagtat ctccagcacc tatagtatta 
gtatctataa gtaaagttct tgataaagat 
ggtatagcta cttcagttat aaacaaaatg 

<210> 5 
<211> 2158 
<212> DNA 

<213> Clostridium difficile 
<400> 5 

atgaataaga aaaatatagc aatagctatg 
cctgtttttg ctgcaactac tggaacacaa 
aaagcagtaa aacaattaca agatggacta 
tcttttaatg atggggttgt gggtgaagta 
agagatgctg cagctgagaa gttatataat 
gatggagatt atgttgattt ttctgtagat 
caagcagatg cagaagcaat tgttacaaag 
gatatagcaa ctaaagatac ttttggaatg 
aatgttgctg caacaaaggc acttaaagtt 
ggtggaagcg aagatactgg atatgttgtt 
tatggtaaag ttggagatag tacggcaggt 
gaatatgcag gtaaaggaac aacaattgat 
ggtggttcaa cacctagtgc tgtagctgta 
ttagcaaaat caggtactat aaatgtaaga 
atagatgcaa gctcatatac atcagctgaa 
gatgaaattt ctgaagcata taaggcaata 
aatttagttc agttagttaa tggaaaatat 
ttagaaacta aatcagcaaa tgatacaata 
ataaaagcta ataaattaaa agatttaaaa 
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aaatctccag atagatatag tggagatgat 1860 

gaatcttctt attatcaaga taacttaaat 1920 

gctaaagatg gttctactaa agaagatcaa 1980 

gcagcaaact ttggtgtaac tcttaattct 2040 

gtattaactg gttctgataa tgataaaaat 2100 

gctactgatt ctttatcttc agatcaaagt 2160 

aatggagaaa acttagttca agttggtaaa 2220 

aaagatttat tagatatgta a 2271 



tcaggtttaa cagttttagc ttcggctgct 60 

ggttatactg tagttaaaaa cgactggaaa 120 

aaagataata gtataggaaa gataactgta 180 

gctcctaaaa gtgctaataa gaaagcggac 240 

cttgttaaca ctcaattaga taaattaggt 300 

tataatttag aaaacaaaat aataactaat 360 

ttaaattcac ttaatgagaa aactcttatt 420 

gttagtaaaa cacaagatag tggaggtaaa 480 

aaagatgttg ctacatttgg tttgaagtct 540 

gaaatgaaag caggagctgt agaggataag 600 

attgcaataa atcttcctag tactggactt 660 

tttaataaaa ctttaaaagt tgatgtaaca 720 

agtggttttg taactaaaga tgatactgat 780 

gttataaatg caaaagaaga atcaattgat 840 

aatttagcta aaagatatgt atttgatcca 900 

gtagcattac aaaatgatgg tatagagtct 960 

caagtgattt tttatccaga aggtaaaaga 1020 

gctagtcaag atacaccagc taaagtagtt 1080 

gattatgtag atgatttaaa aacatataat 1140 
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aatacttatt 


caaatgttgt 


aacagtagca 


ggagaagata 


gaatagaaac 


tgctatagaa 


1200 


ttaagtagta 


aatattataa 


ttctgatgat 


aaaaatgcaa 


taactgataa 


agcagttaat 


1260 


gatatagtat 


tagttggatc 


tacatctata 


gttgatggtc 


ttgttgcatc 


accattagct 


1320 


tcagaaaaaa 


cagctccatt 


attattagct 


tcaaaagata 


aattagattc 


atcagtaaaa 


1380 


tctgaaataa 


agagagttat 


gaacttaaag 


agtgacactg 


gtataaatac 


ttctaaaaaa 


1440 


gtttatttag 


ctggtggagt 


taattctata 


tctaaagatg 


tagaaaatga 


attgaaaaac 


1500 


atgggtctta 


aagttactag 


attatcagga 


gaagacagat 


acgaaacttc 


tttagcaata 


1560 


gctgatgaaa 


taggtcttga 


taatgataaa 


gcatttgtag 


ttggtggtac 


tggattagca 


1620 


gatgctatga 


gtatagctcc 


agttgcttct 


caacttaaag 


atggagatgc 


tactccaata 


1680 


gtagttgtag 


atggaaaagc 


aaaagaaata 


agtgatgatg 


ctaagagttt 


cttaggaact 


1740 


tctgatgttg 


atataatagg 


tggaaaaaat 


agcgtatcta 


aagagattga 


agagtcaata 


1800 


gatagtgcaa 


ctggaaaaac 


tccagataga 


ataagtggag 


atgatagaca 


agcaactaat 


1860 


gctgaagttt 


taaaagaaga 


tgattatttc 


acagatggtg 


aagttgtgaa 


ttactttgtt 


1920 


gcaaaagatg 


gttctactaa 


agaagatcaa 


ttagtagatg 


ccttagcagc 


agcaccaata 


1980 


gcaggtagat 


ttaaggagtc 


tccagctcca 


atcatactag 


ctactgatac 


tttatcttct 


2040 


gaccaaaatg 


tagctgtaag 


taaagcagtt 


cctaaagatg 


gtggaactaa 


cttagttcaa 


2100 


gtaggtaaag 


gtatagcttc 


ttcagttata 


aacaaaatga 


aagatttatt 


agatatgg 


2158 


<210> 6 
<211> 2217 
<212> DNA 

<213> Clostridium difficile 










<400> 6 
atgaataaga 


aggatatagc 


aatagctatg 


tcaggattaa 


cagtattagc 


ttctgcagca 


60 


cctgtatttg 


ctgctagtag 


ttttacagca 


gattataatt 


atactgtagt 


gcaaggaaaa 


120 


tatcaaaaag 


ttataactgg 


attacaagat 


ggtttaaaaa 


atggaaaaat 


aacaaatatt 


180 


gatgtaatat 


ttgatggaag 


ttcaattggt 


gaggtagtgc 


caggttctga 


tgctgcagct 


2 4 0 


gcagctacta 


aattaaaaag 


tttagttgat 


gataagttag 


ataacttagg 


tgatggaaaa 


300 


tacgttcaat 


ttaatgttac 


ttatactact 


aaatctataa 


taactaaagc 


agaattaaaa 


n f f\ 

3 60 


aattattata 


atcaattaga 


aagtagtaaa 


gatagaatac 


ttataggaaa 


tgaacctcaa 


420 


gatacaggaa 


ctaaaggtct 


tataaaagct 


gatactgatg 


gtactactgc 


tgttgcagca 


480 


gctgcaccat 


tgaaattatc 


agatatattt 


acgtttagtt 


atgatgaagt 


aacaggtgta 


540 


cttaaagcag 


aaccaacaag 


taaagtaagc 


gctggtaaag 


ttcaaggtct 


aaaatatgga 


600 
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aatacaggag 


caactaacta 


tacttctgga 


get gaaatat 


4— 4- 4— /~» z'"' 4~~ a 

Cugt icctdu 


LaLdy y L- L La 


660 


acattaactg 


ctgatacaac 


tgcaacaaca 


gat gtaaat a 




+- a ft a 4- r-% f* aa 

u d l y a y Ly La 


720 


tttaaattta 


atggtactga 


tacgattagt 


ggattcccag 




d y L L LLLdLL 


7 8 0 


cttagagcaa 


gtataaaagt 


aataaatgca 


aaagaagaat 




Ly a l l L-aay l 


8 4 0 


tcacatagaa 


cagctgaaga 


tttagctgaa 


aaatatgtat 




aLja Ly Ly aa l 


900 


aaaacttatg 


aggcactgac 


tgatttatat 


aaagaaggta 


t aacaagt aa 


4- /■> 4- 4- ^ 4~ /~« 4~ 
LCLLaLCaCL 




caagatggtg 


gaaaatatca 


agttgtttta 


ttt gctcaag 


gaaagagatt 


ddLLdL L a ct a 


1020 


ggagcaactg 


gaactttagc 


agatgaaaat 


tctcctctta 


aagL.aaodci l 


ddaay uay a l. 


108 0 


aaagtaaaag 


acttaaaaga 


ttatgttgaa 


gattt aaaaa 


atgCtdaCdd 


Ly y d La L LUa 


114 0 


aattctgttg 


ttgtagcagg 


tgaagataga 


at agaaacag 


caatagagLi 


day Lay L-dda 


1200 


tactataact 


ctgatgatga 


caatgcaata 


act aaagat c 


Cay ILdaCaa 


Ly LLyLLLLa 


12 60 


gttggttctc 


aagctgtagt 


tgatgggctt 


gtagcttcac 


y 1 4- 4— 4— /-Sf /— 4 — i 4— 

ClLLdgCdLC 


Lyaaaaaaya 


1320 


gctcctttac 


tattaacttc 


agcaggaaaa 


L.LagauL.caa 


y Ly Laaay u 


i^yay aa a 


1380 


agagtaatgg 


atttaaaatc 


tacaacaggt 


gt aaatact t 


ctaaaaaagi- 


L LdL- L Ldy L. L 


14 40 


ggtggagtaa 


actctatatc 


taaagatgta 


gaaaat gaat 


4- ~i ~i ~~i /-T -a 4— -a 4- 
LdadagdLdL 


yyydUL l a a a 


1500 


gttacaagat 


tatcaggaga 


tgatagatat 


gaaacttctt 


IdgCLatdgL 


Ly d l y a a a l a 


1560 


ggtcttgata 


atgataaagc 


ttttgtagtt 


ggaggaacag 


✓^r — t 4~~ 4~ £s t — T ^ /T 3 

gatuageggd 


LyLLdLyay L 


1620 


atagctccag 


ttgcttctca 


attaagaaac 


tcaaauggdy 




^ ^^^crcrtaat 

a a a a y y zJ 


1680 


gcaactccaa 


tagtagttgt 


tgatggaaaa 


gctaaagata 


LaaaLLL-Lya 


ay i_aaaaya »— 


1740 


ttcttagatg 


attcacaagt 


tgatataata 


ggtggtgt a a 




i_ a a ci y a ci v4 i— ca 


1800 


atggaagcaa 


tagatgatgc 


tactggaaaa 


t cacctgaga 


gauaiagi-gg 


ay day a Lay a 


1860 


caagcaacaa 


atgctaaagt 


tataaaagaa 


gatgattt ct 


ttaaaaatgg 


^/TaaairT"} _ 't"i4f^'Pi 
dy ddy L LdLa 


1920 


aacttctttg 


tagctaaaga 


tggttcaact 


aaagaagat c 


—i — i +- 4- /t 4— n yr a^ 

aattagtaga 


4- /-• 4- 4- 4- anPfl 

LyL-L l l a y ^ a 


1980 


ggtgctgcaa 


ttgctggtaa 


ctttggtgta 


acagtagata 


atgaaggaaa 


acctacagtt 


9 H A Pi 
Z U *i U 


gctgataaaa 


aagcttctcc 


agcaccaatt 


gttttagcaa 


cagattcttt 


atcttctgat 


2100 


caaaatgtag 


ctataagtaa 


agctgtaaat 


gatgaegcta 


atactaagaa 


tctagttcaa 


2160 


gttggtaaag 


gtatagctac 


ttcagttgta 


agtaaaataa 


aagatttatt 


agatatg 


2217 



<210> 7 

<211> 2145 

<212> DNA 

<213> Clostridium difficile 
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<400> 7 



atgaataaga 


aaaacttagc 


aatggctatg 


gcagcagt ta 


ctgttgtggg 


4— 4— /—i +- c~r i— _1 /~T <r 

LLCugcaycg 


fin 


ccaatatttg 


cagatagtac 


tacgccaggt 


t a tact g tag 


ugaaaaauga 


ff niiaaaaaa 
L Lyydddddd 


j_ /_ w 


gcagtaaaac 


aattacaaga 


tgggttgaaa 


aat aaaact a 


tatcaacaat 


dddyy l y ll-l 


1 fid 
j. 0 \j 


tttaatggaa 


actctgttgg 


agaagttaca 


ccagccagtt 


ctggagcaaa 


aaaagcagat 




agagatgctg 


cagctgaaaa 


gttatataat 


tt agt aaat a 


cacaattaga 


4- n 2 a i" o /T /t 4— 
LdddLLdgy L 


O VJ VJ 


gatggagatt 


acgttgactt 


tgaagtaact 


t at aattt ag 


^--i 4* f-* 4^ /-*- ___ *H 

CtaCtCaaaL 


d d L LaLdadd 


3 60 


gcagaagcag 


aggcagttct 


tacaaaatt a 


caacaat at a 


—I 4— "__ 4~ _3 "_1 _3 r*T \" 

d LyaLaaay l 


dL U Ld Uddd L. 


420 


tctgcaacag 


atacagtaaa 


aggtatggta 


tctgatacac 


aaguT-gauag 


r - a a a a a "I - fr+* +" 
Ldadad Ly L L 


4 r n 

T O W 


gcagctaacc 


cacttaaagt 


tagtgatatg 


t at acaatac 


cauCL.gci.aL 


4- 3 rfrtAa a n +■ 
Ldu uyy day l 


54 0 


gatgattctg 


ggtatagtat 


tgctaaacca 


acagaaaaga 


ctacaagttt 


—.4—4- +~ 13 "H /"T "f~ 

dtty Ldi,gy l 


fin o 

U L> L 1 


acggttggtg 


atgcaactgc 


aggtaaagca 


ataacagtag 


atacagcttc 


dddLyddgC _ 


fifin 

vj O VJ 


tttgctggaa 


atggaaaggt 


tattgactac 


aat aaatcat 


tcaaagcaac 


LgLdLddyy d 


7 2 0 


gatggaacag 


ttaagacaag 


cggggtt gt a 


✓—i 4— -7t -__ ^ /^x ___ 4~~ /*t 

CLtdaaydLy 


L-dcty L.y a L-ci L. 


yyv_/ l y L-a a v.- a 


780 


ggtactataa 


aagttagagt 


tacaagtgca 


aaagaagaat 


ctattgatgt 


yydL Ltddy L 


8 4 0 


tcatatatta 


gtgctgaaaa 


tttagctaaa 


aaatat gt at 


4— 4— — >— i-f-^— <— i+- ^i_> 

LLdaLCCLad 


d y d y y l l ll-l. 


900 


gaagcttata 


atgcaatagt 


tgcattacaa 


aat gat ggaa 


LdgadtCLCJd 


L L Ldy _.dt—dd 


960 


ttagttaatg 


gaaaatatca 


. 1_J__.1_J_.I_4_-, 

agttattttc 


tatccagaag 


gaaaaayd l l 


3ffaaar1"3aa 
dy uudL. L a u a 


1020 


tctgcagata 


taatagctga 


tgcagatagt 


ccagctaaaa 


LddLLdLaaa 


dyL-i—dci logo 


1080 


ttaaaagatt 


taaaagatta 


tgtagat gat 


ttaaaaacat 


o /-»_»_» "t" _i _l "I" iS /" * 

dtda Lda Lau 


U L Q L. L V u U U L 


1140 


gttgtaacag 


tagcaggaga 


agatagaata 


gaaactgcta 


tagaattaag 


+-3/^fl"a3ai"a+" 
Ldy LdddLd L 


12 00 


tataattctg 


atgataaaaa 


tgcaat aact 


gatgatgcag 


LLddLdd Ld L 


dytflLLGyLL 


1260 


ggatctacat 


ctatagttga 


tggtcttgtt 


gcatcaccat 


LagcLLcaga 


dadaaLay - -. 


1320 


ccattattat 


taacttcaaa 


agataaatta 


gattcatcag 


Ldddd LLiyd 


y Q L.Q go way cj. 


1380 


gttatgaact 


taaagagtga 


tactggtata 


aat actt ct a 


aaaaagu l ud 


L l. L-dyouyy <— 


14 4 0 


ggagttaatt 


ctatatctaa 


agatgtagaa 


gatgaattga 


aaaatatggg 


CCt Ldddy L L. 


150 0 


actagattat 


caggagaaga 


cagatacgaa 


i. j_ j_ +_ j_ 

acttctttag 


caaLagcuga 


Ly add Ldyy i_ 


1560 


cttgataatg 


ataaagcatt 


tgtagttggt 


ggtactggat 


tggcagatgc 


tat gagtata 


± D __ U 


gctccagttg 


cttctcaact 


taaagatgga 


gatgctactc 


caatagtagt 


tgtagatgga 


1680 


aaagcaaaag 


aaataagtga 


tgatgctaag 


agtttcttag 


gaacttctga 


tgttgatata 


1740 


ataggtggaa 


aaaatagcgt 


atctaaagag 


attgaagagt 


caatagatag 


tgcaactgga 


1800 
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aaaactccag atagaataag tggagatgac 
gaagatgatt atttcaaaga tggtgaagtt 
actaaagaag atcaattagt agatgcatta 
gagtctccag ctccaatcat actagctact 
gtaagtaaag cagttcctaa agatggtgga 
gcttcttcag ttataaacaa aatgaaagat 

<210> 8 
<211> 2158 
<212> DNA 

<213> Clostridium difficile 
<400> 8 

atgaataaga aaaatatagc aatagctatg 
cctgtttttg ctgcaactac tggaacacaa 
aaagcagtaa aacaattaca agatggacta 
tcttttaatg atggggttgt gggtgaagta 
agagatgctg cagctgagaa gttatataat 
gatggagatt atgttgattt ttctgtagat 
caagcagatg cagaagcaat tgttacaaag 
gatatagcaa ctaaagatac ttttggaatg 
aatgttgctg caacaaaggc acttaaagtt 
ggtggaagcg aagatactgg atatgttatt 
tatggtaaag ttggagatag tacggcaggt 
gaatatgcag gtaaaggaac aacaattgat 
ggtggttcaa cacctagtgc tgtagctgta 
ttagcaaaat caggtactat aaatgtaaga 
atagatgcaa gctcatatac atcagctgaa 
gatgaaattt ctgaagcata taaggcaata 
aatttagttc agttagttaa tggaaaatat 
ttagaaacta aatcagcaaa tgatacaata 
ataaaagcta ataaattaaa agatttaaaa 
aatacttatt caaatgttgt aacagtagca 
ttaagtagta aatattataa ttctgatgat 
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agacaagcaa ctaatgctga agttttaaaa 1860 

gtgaattact ttgttgcaaa agatggttct 1920 

gcagcagcac caatagcagg tagatttaag 1980 

gatactttat cttctgacca aaatgtagct 2040 

actaacttag ttcaagtagg taaaggtata 2100 

ttattagata tgtaa 2145 



tcaggtttaa 


/~» /-r -4- 4- +■ +■ a /t 

Cdyt ittayc 




60 


ggtt a tact g 


4— — * /-* 4- O —J —3 -"1 

LdyLLddddd 




120 


aaagat aata 


gtataggaaa 


ydLdaCty Ld 


1 on 
± 0 u 


gctcctaaaa 


gtgct aat aa 


gaaagcyya.c 


9 AO 

*i \J 


ct tgtt aaca 


ctcaattaga 


4— — i 4— 4— s~r t~r 4— 

L.ciclclLL.a.yy L. 


300 


tataatttag 


aaaaaaaaat 


^-~ s 4-—\--\/— <4— — l— j4— 


3 60 


ttaaattcac 


ttaatgagaa 


daCtC/LLdL L 


4 20 


gttagtaaaa 


cacaagatag 


4— /-r -~i — j n /~r +■ -i "1 -3 

Lyaayy Lddd 


d R o 


aaagatgttg 


ctacatttgg 


tttgaagtct 


540 


gaaatgaaag 


caggagctgt 


agaggataag 


600 


attgcaataa 


atcttcctag 


tactggactt 


660 


tttaataaaa 


ctttaaaagt 


tgatgtaaca 


720 


agtggttttg 


taactaaaga 


tgatactgat 


780 


gttataaatg 


caaaagaaga 


atcaattgat 


840 


aatttagcta 


aaagacatgt 


atttgatcca 


900 


gtagcattac 


aaaatgatgg 


tatagagtct 


960 


caagtgattt 


tttatccaga 


aggtaaaaga 


1020 


gctagtcaag 


atacaccagc 


taaagtagtt 


1080 


gattatgtag 


atgatttaaa 


aacatataat 


1140 


ggagaagata 


gaatagaaac 


tgctatagaa 


1200 


aaaaatgcaa 


taactgataa 


agcagttaat 


1260 



WO 02/062379 PCT/IE02/00017 

11/11 

gatatagtat tagttggatc tacatctata gttgatggtc ttgttgcatc accattagct 1320 

tcagaaaaaa cagctccatt attattaact tcaaaagata aattagattc atcagtaaaa 1380 

tctgaaataa agagagttat gaacttaaag agtgacactg gtataaatac ttctaaaaaa 1440 

gtttatttag ctggtggagt taattctata tctaaagatg tagaaaatga attgaaaaac 1500 

atgggtctta aagttactag attatcagga gaagacagat acgaaacttc tttagcaata 1560 

gctgatgaaa taggtcttga taatgataaa gcatttgtag ttggtggtac tggattagca 1620 

gatgctatga gtatagctcc agttgcttct caacttaaag atggagatgc tactccaata 1680 

gtagttgtag atggaaaagc aaaagaaata agtgatgatg ctaagagttt cttaggaact 1740 
tctgatgttg atataatagg tggaaaaaat agcgtatcta aagagattga agagtcaata 
gatagtgcaa ctggaaaaac tccagataga ataagtggag atgatagaca agcaactaat 

gctgaagttt taaaagaaga tgattatttc acagatggtg aagttgtgaa ttactttgtt 1920 



1800 
1860 



1980 
2040 



gcaaaagatg gttctactaa agaagatcaa ttagtagatg ccttagcagc agcaccaata 

gcaggtagat ttaaggagtc tccagctcca atcatactag ctactgatac tttatcttct 

gaccaaaatg tagctgtaag taaagcagtt cctaaagatg gtggaactaa cttagttcaa 2100 

gtaggtaaag gtatagcttc ttcagttata aacaaaatga aagatttatt agatatga 2158 

<210> 9 
<211> 20 
<212> PRT 

<213> Clostridium difficile 
<400> 9 

Asp Lys Thr Lys Val Glu Thr Ala Asp Gin Gly Tyr Thr Val Val Gin 
15 10 15 



Ser Lys Tyr Lys 
20 



<210> 10 

<211> 20 

<212> PRT 

<213> Clostridium difficile 

<400> 10 

Ala Thr Thr Gly Thr Gin Gly Tyr Thr Val Val Lys Asn Asp Gly Lys 
1 5 10 15 



Lys Ala Val Lys 
20 
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SEQ ID No. 1 
(Strain 171500) 

DKTKVETADQGYTVVQSKYK 

SEQ ID No. 2 
(Strain 170324) 

ATTTGTQGYTVVKNDGKKAVK 



SEQ ID No. 3 
(Strain 171500 DNA) 

ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCACCT 

GTATTTGCAGATGATACAAAAGTTGAAACTGGTGATCAAGGATATACAGTGGTACAAAGCAA 

GTATAAGAAAGCTGTTGAACAATTACAAAAAGGAATATTAGATGGAAGTATAACAGAAATTA 

AAGTTTTCTTTGAGGGAACTTTAGCATCTACTATAAAAGTAGGTTCTGAGCTTAATGCAGCAG 

ATGCAAGTAAATTATTGTTTACACAAGTAGATAATAAACTAGATAATTTAGGTGATGGAGATT 

ATGTAGATTTCTTAATAACTTCTCCAGGTCAAGGGGATAAAATAACTACAAGTAAACTTGTTG 

CATTGAAAGATTTAACAGGTGCTTCAGCAGATGCTATAATTGCTGGAACATCTTCAGCAGATG 

GTGTTGTTACAAATACTGGAGCTGCTAGTGGTTCTACTGAGACAAATTCAGCAGGAACAAAAC 

TTGCAATGTCAGCTATTTTTGACACAGCATATACAGATTCATCTGAAACTGCGGTTAAGATTA 

CTATAAAAGCAGATATGAATGATACTAAATTTGGTAAAGCAGGTGAGACAACTTATTCAACTG 

GGCTTACATTTGAAGATGGGTCTACAGAAAAAATTGTTAAATTAGGGGACAGTGATATTATAG 

ATATAACTAAAGCTCTTAAACTTACTGTTGTTCCTGGAAGTAAAGCAACTGTTAAGTTTGCTG 

AAAAAACACCAAGTGCCAGTGTTCAACCAGTAATAACAAAGCTTAGAATAATAAATGCTAAA 

GAAGAAACAATAGATATTGACGCTAGTTCTAGTAAAACAGCACAAGATTTAGCTAAAAAATA 

TGTATTTAATAAAACTGATTTAAATACTCTTTATAAAGTATTAAATGGAGATGAAGCAGATAC 

TAATGGATTAATAGAAGAAGTTAGTGGAAAATATCAAGTAGTTCTTTATCCAGAAGGAAAAA 

GAGTTACAACTAAGAGTGCTGCAAAGGCTTCAATTGCTGATGAAAATTCACCAGTTAAATTAA 

CTCTTAAGTCAGATAAGAAGAAAGACTTAAAAGATTATGTGGATGATTTAAGAACATATAAT 

AATGGATATTCAAATGCTATAGAAGTAGCAGGAGAAGATAGAATAGAAACTGCAATAGCATT 

AAGTCAAAAATATTATAACTCTGATGATGAAAATGCTATATTTAGAGATTCAGTTGATAATGT 

AGTATTGGTTGGAGGAAATGCAATAGTTGATGGACTTGTAGCTTCTCCTTTAGCTTCTGAAAA 

GAAAGCTCCTTTATTATTAACTTCAAAAGATAAATTAGATTCAAGCGTAAAAGCTGAAATAAA 

GAGAGTTATGAATATAAAGAGTACAACAGGTATAAATACTTCAAAGAAAGTTTATTTAGCTG 

GTGGAGTTAATTCTATATCTAAAGAAGTAGAAAATGAATTAAAAGATATGGGACTTAAAGTT 

ACAAGATTAGCAGGAGATGATAGATATGAAACTTCTCTAAAAATAGCTGATGAAGTAGGTCT 

TGATAATGATAAAGCATTTGTAGTTGGAGGAACAGGATTAGCAGATGCCATGAGTATAGCTCC 

AGTTGCATCTCAATTAAGAAATGCTAATGGTAAAATGGATTTAGCTGATGGTGATGCTACACC 

AATAGTAGTTGTAGATGGAAAAGCTAAAACTATAAATGATGATGTAAAAGATTTCTTAGATG 

ATTCACAAGTTGATATAATAGGTGGAGAAAACAGTGTATCTAAAGATGTTGAAAATGCAATA 

GATGATGCTACAGGTAAATCTCCAGATAGATATAGTGGAGATGATAGACAAGCAACTAATGC 

AAAAGTTATAAAAGAATCTTCTTATTATCAAGATAACTTAAATAATGATAAAAAAGTAGTTAA 

TTTCTTTGTAGCTAAAGATGGTTCTACTAAAGAAGATCAATTAGTTGATGCTTTAGCAGCAGCT 

CCAGTTGCAGCAAACTTTGGTGTAACTCTTAATTCTGATGGTAAGCCAGTAGATAAAGATGGT 

AAAGtATTAACTGGTTCTGATAATGATAAAAATAAATTAGTATCTCCAGCACCTATAGTATTAG 

CTACTGATTCTTTATCTTCAGATCaAAGTGTATCTATAAGTAaAGTTCTTGATAAAGATAATGG 

AGAAAACTTAGTTCAAGTTGGTAAAGGTATAGCTACTTCAGTTATAAACAAAATGAAAGATTT 

ATTAGATATGTAA 
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SEQ ID No. 4 
(Strain 172450 DNA) 

ATGAAAAAAAGAAATTTAGCAATGGCTATGGCAGCTGTTACTGTAGTAGGTTCTGCTGCTCCA 

GTTTTTGCAGCAGCTTCAGATGTAATATCACTACAAGATGGTACAAATGATAAGTATACAGTA 

TCAAATACTAAAGCTAGTGACTTAGTAAAGGATATTTTAGCAGCACAAAACTTAACAACAGGT 

GCAGTTATTTTGAACAAAGATACAAAAGTTACTTTCTATGATGCAAATGAGAAAGATTCTTCA 

ACTCCAACTGGAGATAAAAAAGTTTATTCAGAACAAACTTTAACTACAGCTAATGGAAATGA 

AGATTATGTAAAGACAACTTTAAAAAATTTAGATGCAGGAGAATATGCTATTATAGATTTAAC 

TTATAATAATGCTAAAACTGTTGAAATTAAAGTAGTAGCAGCTAGTGAAAAAACAGTAGTTGT 

ATCTAGTGATGCGAAAAATAGTGCAAAAGATATAGCTGAAAAATATGTGTTTGAAGACAAAG 

ACTTAGAAAATGCACTAAAAACTATAAATGCCTCAGATTTCAGTAAAACTGATAGTTACTATC 

AAGTAGTTCTTTATCCAAAAGGAAAGAGATTACAAGGTTTCTCAACTTATAGAGCTACAAATT 

ATAATGAAGGAACTGCATATGGTAATACACCAGTAATATTAACTCTAAAATCTACTAGTAAGA 

GTAATTTAAAGACTGCAGTAGAAGAGTTACAAAAATTGAATGCTAGTTATTCTAATACTACAA 

CTTTAGCTGGTGATGACAGAATACAAACAGCTATAGAGATAAGTAAAGAATATTACAATAAT 

GATGGCGAGAAATCAGATCATTCAGCTGATGTTAAAGAGAATGTTAAAAATGTTGTATTAGTA 

GGTGCAAATGCACTAGTAGATGGATTAGTTGCGGCTCCTTTAGCAGCAGAAAAAGATGCTCCA 

CTATTATTAACTTCAAAAGATAAATTAGATTCGTCAGTAAAATCTGAAATAAAGAGAGTTTTA 

GACTTAAAAACTTCAACAGAAGTAACAGGAAAAACAGTTTATATAGCTGGTGGAGTTAATAG 

TGTATCTAAAGAAGTTGTAACAGAATTAGAATCAATGGGATTAAAAGTTGAAAGATTCTCAG 

GTGATGATAGATATGAAACTTCTTTAAAAATAGCAGGTGAAATAGGCTTAGATAATGATAAG 

GCTTATGTAGTTGGTGGAACAGGATTAGCAGATGCCATGAGTATAGCTTCAGTTGCTTCTACT 

AAATTAGATGGTAATGGTGTTGTAGATAGAACAAATGGACATGCTACTCCAATAGTTGTTGTA 

GATGGAAAAGCTGATAAAATATCTGATGACTTAGATAGTTTCTTAGGAAGCGCTGATGTAGAT 

ATAATAGGTGGATTTGCAAGTGTATCTGAAAAGATGGAAGAAGCTATATCAGATGCTACTGGT 

AAAGGCGTTACAAGAGTTAAAGGCGACGATAGACAAGACACTAACTCTGAAGTTATAAAAAC 

ATATTATGCTAATGATACTGAAATAGCTAAAGCTGCAGTTTTAGATAAAGATTCAGGTGCTTC 

AAGTAGTGATGCAGGAGTATTTAATTTCTATGTAGCTAAAGATGGATCTACAAAAGAAGATCA 

ATTAGTTGATGCATTAGCAGTAGGAGCTGTTGCTGGATATAAACTTGCTCCAGTTGTATTAGCT 

ACTGATTCTTTATCTTCTGATCAATCGGTTGCTATAAGCAAAGTTGTAGGAGAAAAATATTCTA 

AAGATTTAACACAAGTTGGTCAAGGAATAGCTAATTCAGTTATAAACAAAATGAAAGATTTAT 

TAGATATG 



SEQ ID No. 5 
(Strain 170324 DNA) 

ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCTCCT 

GTTTTTGCTGCAACTACTGGAACACAAGGTTATACTGTAGTTAAAAACGACTGGAAAAAAGCA 

GTAAAACAATTACAAGATGGACTAAAAGATAATAGTATAGGAAAGATAACTGTATCTTTTAA 

TGATGGGGTTGTGGGTGAAGTAGCTCCTAAAAGTGCTAATAAGAAAGCGGACAGAGATGCTG 

CAGCTGAGAAGTTATATAATCTTGTTAACACTCAATTAGATAAATTAGGTGATGGAGATTATG 

TTGATTTTTCTGTAGATTATAATTTAGAAAACAAAATAATAACTAATCAAGCAGATGCAGAAG 

CAATTGTTACAAAGTTAAATTCACTTAATGAGAAAACTCTTATTGATATAGCAACTAAAGATA 

CTTTTGGAATGGTTAGTAAAACACAAGATAGTGAAGGTAAAAATGTTGCTGCAACAAAGGCA 

CTTAAAGTTAAAGATGTTGCTACATTTGGTTTGAAGTCTGGTGGAAGCGAAGATACTGGATAT 

GTTGTTGAAATGAAAGCAGGAGCTGTAGAGGATAAGTATGGTAAAGTTGGAGATAGTACGGC 

AGGTATTGCAATAAATCTTCCTAGTACTGGACTTGAATATGCAGGTAAAGGAACAACAATTGA 

TTTTAATAAAACTTTAAAAGTTGATGTAACAGGTGGTTCAACACCTAGTGCTGTAGCTGTAAG 

TGGTTTTGTAACTAAAGATGATACTGATTTAGCAAAATCAGGTACTATAAATGTAAGAGTTAT 

AAATGCAAAAGAAGAATCAATTGATATAGATGCAAGCTCATATACATCAGCTGAAAATTTAG 

CTAAAAGATATGTATTTGATCCAGATGAAATTTCTGAAGCATATAAGGCAATAGTAGCATTAC 

AAAATGATGGTATAGAGTCTAACTTAGTTCAGTTAGTTAATGGAAAATATCAAGTGATTTTTT 

ATCCAGAAGGTAAAAGATTAGAAACTAAATCAGCAAATGATACAATAGCTAGTCAAGATACA 

CCAGCTAAAGTAGTTATAAAAGCTAATAAATTAAAAGATTTAAAAGATTATGTAGATGATTTA 

AAAACATATAATAATACTTATTCAAATGTTGTAACAGTAGCAGGAGAAGATAGAATAGAAAC 
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TGCTATAGAATTAAGTAGTAAATATTATAATTCTGATGATAAAAATGCAATAACTGATAAAGC 

AGTTAATGATATAGTATTAGTTGGATCTACATCTATAGTTGATGGTCTTGTTGCATCACCATTA 

GCTTCAGAAAAAACAGCTCCATTATTATTAACTTCAAAAGATAAATTAGATTCATCAGTAAAA 

TCTGAAATAAAGAGAGTTATGAACTTAAAGAGTGACACTGGTATAAATACTTCTAAAAAAGTT 

TATTTAGCTGGTGGAGTTAATTCTATATCTAAAGATGTAGAAAATGAATTGAAAAACATGGGT 

CTTAAAGTTACTAGATTATCAGGAGAAGACAGATACGAAACTTCTTTAGCAATAGCTGATGAA 

ATAGGTCTTGATAATGATAAAGCATTTGTAGTTGGTGGTACTGGATTAGCAGATGCTATGAGT 

ATAGCTCCAGTTGCTTCTCAACTTAAAGATGGAGATGCTACTCCAATAGTAGTTGTAGATGGA 

AAAGCAAAAGAAATAAGTGATGATGCTAAGAGTTTCTTAGGAACTTCTGATGTTGATATAATA 

GGTGGAAAAAATAGCGTATCTAAAGAGATTGAAGAGTCAATAGATAGTGCAACTGGAAAAAC 

TCCAGATAGAATAAGTGGAGATGATAGACAAGCAACTAATGCTGAAGTTTTAAAAGAAGATG 

ATTATTTCACAGATGGTGAAGTTGTGAATTACTTTGTTGCAAAAGATGGTTCTACTAAAGAAG 

ATCAATTAGTAGATGCCTTAGCAGCAGCACCAATAGCAGGTAGATTTAAGGAGTCTCCAGCTC 

CAATCATACTAGCTACTGATACTTTATCTTCTGACCAAAATGTAGCTGTAAGTAAAGCAGTTC 

CTAAAGATGGTGGAACTAACTTAGTTCAAGTAGGTAAAGGTATAGCTTCTTCAGTTATAAACA 

AAATGAAAGATTTATTAGATATGG 



SEQ ID No. 6 
(Strain 171448 DNA) 

ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCTCCT 

GTTTTTGCTGCAACTACTGGAACACAAGGTTATACTGTAGTTAAAAACGACTGGAAAAAAGCA 

GTAAAACAATTACAAGATGGACTAAAAGATAATAGTATAGGAAAGATAACTGTATCTTTTAA 

TGATGGGGTTGTGGGTGAAGTAGCTCCTAAAAGTGCTAATAAGAAAGCGGACAGAGATGCTG 

CAGCTGAGAAGTTATATAATCTTGTTAACACTCAATTAGATAAATTAGGTGATGGAGATTATG 

TTGATTTTTCTGTAGATTATAATTTAGAAAACAAAATAATAACTAATCAAGCAGATGCAGAAG 

CAATTGTTACAAAGTTAAATTCACTTAATGAGAAAACTCTTATTGATATAGCAACTAAAGATA 

CTTTTGGAATGGTTAGTAAAACACAAGATAGTGGAGGTAAAAATGTTGCTGCAACAAAGGCA 

CTTAAAGTTAAAGATGTTGCTACATTTGGTTTGAAGTCTGGTGGAAGCGAAGATACTGGATAT 

GTTGTTGAAATGAAAGCAGGAGCTGTAGAGGATAAGTATGGTAAAGTTGGAGATAGTACGGC 

AGGTATTGCAATAAATCTTCCTAGTACTGGACTTGAATATGCAGGTAAAGGAACAACAATTGA 

TTTTAATAAAACTTTAAAAGTTGATGTAACAGGTGGTTCAACACCTAGTGCTGTAGCTGTAAG 

TGGTTTTGTAACTAAAGATGATACTGATTTAGCAAAATCAGGTACTATAAATGTAAGAGTTAT 

AAATGCAAAAGAAGAATCAATTGATATAGATGCAAGCTCATATACATCAGCTGAAAATTTAG 

CTAAAAGATATGTATTTGATCCAGATGAAATTTCTGAAGCATATAAGGCAATAGTAGCATTAC 

AAAATGATGGTATAGAGTCTAATTTAGTTCAGTTAGTTAATGGAAAATATCAAGTGATTTTTT 

ATCCAGAAGGTAAAAGATTAGAAACTAAATCAGCAAATGATACAATAGCTAGTCAAGATACA 

CCAGCTAAAGTAGTTATAAAAGCTAATAAATTAAAAGATTTAAAAGATTATGTAGATGATTTA 

AAAACATATAATAATACTTATTCAAATGTTGTAACAGTAGCAGGAGAAGATAGAATAGAAAC 

TGCTATAGAATTAAGTAGTAAATATTATAATTCTGATGATAAAAATGCAATAACTGATAAAGC 

AGTTAATGATATAGTATTAGTTGGATCTACATCTATAGTTGATGGTCTTGTTGCATCACCATTA 

GCTTCAGAAAAAACAGCTCCATTATTATTAGCTTCAAAAGATAAATTAGATTCATCAGTAAAA 

TCTGAAATAAAGAGAGTTATGAACTTAAAGAGTGACACTGGTATAAATACTTCTAAAAAAGTT 

TATTTAGCTGGTGGAGTTAATTCTATATCTAAAGATGTAGAAAATGAATTGAAAAACATGGGT 

CTTAAAGTTACTAGATTATCAGGAGAAGACAGATACGAAACTTCTTTAGCAATAGCTGATGAA 

ATAGGTCTTGATAATGATAAAGCATTTGTAGTTGGTGGTACTGGATTAGCAGATGCTATGAGT 

ATAGCTCCAGTTGCTTCTCAACTTAAAGATGGAGATGCTACTCCAATAGTAGTTGTAGATGGA 

AAAGCAAAAGAAATAAGTGATGATGCTAAGAGTTTCTTAGGAACTTCTGATGTTGATATAATA 

GGTGGAAAAAATAGCGTATCTAAAGAGATTGAAGAGTCAATAGATAGTGCAACTGGAAAAAC 

TCCAGATAGAATAAGTGGAGATGATAGACAAGCAACTAATGCTGAAGTTTTAAAAGAAGATG 

ATTATTTCACAGATGGTGAAGTTGTGAATTACTTTGTTGCAAAAGATGGTTCTACTAAAGAAG 

ATCAATTAGTAGATGCCTTAGCAGCAGCACCAATAGCAGGTAGATTTAAGGAGTCTCCAGCTC 

CAATCATACTAGCTACTGATACTTTATCTTCTGACCAAAATGTAGCTGTAAGTAAAGCAGTTC 

CTAAAGATGGTGGAACTAACTTAGTTCAAGTAGGTAAAGGTATAGCTTCTTCAGTTATAAACA 

AAATGAAAGATTTATTAGATATGG 
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SEQ ID No. 7 
ATGAATAAGAA^ 

ATATTTGCAGATAGTACTACGCCAGGTTATACTGTAGTGAAAAATGATTGGAAAAAAGCAGT 

AAAACAATTACAAGATGGGTTGAAAAATAAAACTATATCAACAATAAAGGTGTCTTTTAATG 

GAAACTCTGTTGGAGAAGTTACACCAGCCAGTTCTGGAGCAAAAAAAGCAGATAGAGATGCT 

GCAGCTGAAAAGTTATATAATTTAGTAAATACACAATTAGATAAACTAGGTGATGGAGATTAC 

GTTGACTTTGAAGTAACTTATAATTTAGCTACTCAAATAATTACAAAAGCAGAAGCAGAGGCA 

GTTCTTACAAAATTACAACAATATAATGATAAAGTACTTATAAATTCTGCAACAGATACAGTA 

AAAGGTATGGTATCTGATACACAAGTTGATAGCAAAAATGTTGCAGCTAACCCACTTAAAGTT 

AGTGATATGTATACAATACCATCTGCTATTACTGGAAGTGATGATTCTGGGTATAGTATTGCT 

AAACCAACAGAAAAGACTACAaGTTTATTGTATGGTACGGTTGGTGATGCAACTGCAGGTAAA 

GCAATAACAGTAGATACAGCTTCAAATGAAGCTTTTGCTGGAAATGGAAAGGTTATTGACTAC 

AATAAATCATTCAAAGCAACTGTACAAGGAGATGGAACAGTTAAGACAAGCGGGGTTGTACT 

TAAAGATGCAAGTGATATGGCTGCAACAGGTACTATAAAAGTTAGAGTTACAAGTGCAAAAG 

AAGAATCTATTGATGTGGATTCAAGTTCATATATTAGTGCTGAAAATTTAGCTAAAAAATATG 

TATTTAATCCTAAAGAGGTTTCTGAAGCTTATAATGCAATAGTTGCATTACAAAATGATGGAA 

TAGAATCTGATTTAGTACAATTAGTTAATGGAAAATATCAAGTTATTTTCTATCCAGAAGGAA 

AAAGATTAGAAACTAAATCTGCAGATATAATAGCTGATGCAGATAGTCCAGCTAAAATAACT 

ATAAAAGCTAATAAATTAAAAGATTTAAAAGATTATGTAGATGATTTAAAAACATACAATAA 

TACTTACTCAAATGTTGTAACAGTAGCAGGAGAAGATAGAATAGAAACTGCTATAGAATTAA 

GTAGTAAATATTATAATTCTGATGATAAAAATGCAATAACTGATGATGCAGTTAATAATATAG 

TATTAGTTGGATCTACATCTATAGTTGATGGTCTTGTTGCATCACCATTAGCTTCAGAAAAAAC 

AnCTCCATTATTATTAACTTCAAAAGATAAATTAGATTCATCAGTAAAATCTGAGATAAAAAG 

AGTTATGAACTTAAAGAGTGATACTGGTATAAATACTTCTAAAAAAGTTTATTTAGCTGGTGG 

AGTTAATTCTATATCTAAAGATGTAGAAGATGAATTGAAAAATATGGGCCTTAAAGTTACTAG 

ATTATCAGGAGAAGACAGATACGAAACTTCTTTAGCAATAGCTGATGAAATAGGTCTTGATAA 

TGATAAAGCATTTGTAGTTGGTGGTACTGGATTGGCAGATGCTATGAGTATAGCTCCAGTTGC 

TTCTCAACTTAAAGATGGAGATGCTACTCCAATAGTAGTTGTAGATGGAAAAGCAAAAGAAA 

TAAGTGATGATGCTAAGAGTTTCTTAGGAACTTCTGATGTTGATATAATAGGTGGAAAAAATA 

GCGTATCTAAAGAGATTGAAGAGTCAATAGATAGTGCAACTGGAAAAACTCCAGATAGAATA 

AGTGGAGATGACAGACAAGCAACTAATGCTGAAGTTTTAAAAGAAGATGATTATTTCAAAGA 

TGGTGAAGTTGTGAATTACTTTGTTGCAAAAGATGGTTCTACTAAAGAAGATCAATTAGTAGA 

TGCATTAGCAGCAGCACCAATAGCAGGTAGATTTAAGGAGTCTCCAGCTCCAATCATACTAGC 

TACTGATACTTTATCTTCTGACCAAAATGTAGCTGTAAGTAAAGCAGTTCCTAAAGATGGTGG 

AACTAACTTAGTTCAAGTAGGTAAAGGTATAGCTTCTTCAGTTATAAACAAAATGAAAGATTT 

ATTAGATATGTAA 
SEQ ID No. 8 

ATGAAT^AGAAGGATATAGCAATAGCTATGTCAGGATTAACAGTATTAGCTTCTGCAGCACCT 

GTATTTGCTGCTAGTAGTTTTACAGCAGATTATAATTATACTGTAGTGCAAGGAAAATATCAA 

AAAGTTATAACTGGATTACAAGATGGTTTAAAAAATGGAAAAATAACAAATATTGATGTAAT 

ATTTGATGGAAGTTCAATTGGTGAGGTAGTGCCAGGTTCTGATGCTGCAGCTGCAGCTACTAA 

ATTAAAAAGTTTAGTTGATGATAAGTTAGATAACTTAGGTGATGGAAAATACGTTCAATTTAA 

TGTTACTTATACTACTAAATCTATAATAACTAAAGCAGAATTAAAAAATTATTATAATCAATT 

AGAAAGTAGTAAAGATAGAATACTTATAGGAAATGAACCTCAAGATACAGGAACTAAAGGTC 

TTATAAAAGCTGATACTGATGGTACTACTGCTGTTGCAGCAGCTGCACCATTGAAATTATCAG 

ATATATTTACGTTTAGTTATGATGAAGTAACAGGTGTACTTAAAGCAGAACCAACAAGTAAAG 

TAAGCGCTGGTAAAGTTCAAGGTCTAAAATATGGAAATACAGGAGCAACTAACTATACTTCTG 

GAGCTGAAATATCTGTTCCTACTACAGGCTTAACATTAACTGCTGATACAACTGCAACAACAG 

ATGTAAATATTTCTGATGTTATGAGTGCATTTAAATTTAATGGTACTGATACGATTAGTGGATT 

CCCAGCTGGTTCATCAGCTTCTACTCTTAGAGCAAGTATAAAAGTAATAAATGCAAAAGAAGA 

ATCTATAGATGTTGATTCAAGTTCACATAGAACAGCTGAAGATTTAGCTGAAAAATATGTATT 
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TAAACCAGAAGATGTGAATAAAACTTATGAGGCACTGACTGATTTATATAAAGAAGGTATAA 

CAAGTAATCTTATCACTCAAGATGGTGGAAAATATCAAGTTGTTTTATTTGCTCAAGGAAAGA 

GATTAACTACTAAAGGAGCAACTGGAACTTTAGCAGATGAAAATTCTCCTCTTAAAGTAACAA 

TAAAAGCAGATAAAGTAAAAGACTTAAAAGATTATGTTGAAGATTTAAAAAATGCTAACAAT 

GGATATTCAAATTCTGTTGTTGTAGCAGGTGAAGATAGAATAGAAACAGCAATAGAGTTAAG 

TAGCAAATACTATAACTCTGATGATGACAATGCAATAACTAAAGATCCAGTTAACAATGTTGT 

TTTAGTTGGTTCTCAAGCTGTAGTTGATGGGCTTGTAGCTTCACCTTTAGCATCTGAAAAAAGA 

GCTCCTTTACTATTAACTTCAGCAGGAAAATTAGATTCAAGTGTTAAAGCTGAGTTGAAAAGA 

GTAATGGATTTAAAATCTACAACAGGTGTAAATACTTCTAAAAAAGTTTACTTAGCTGGTGGA 

GTAAACTCTATATCTAAAGATGTAGAAAATGAATTAAAAGATATGGGACTTAAAGTTACAAG 

ATTATCAGGAGATGATAGATATGAAACTTCTTTAGCTATAGCTGATGAAATAGGTCTTGATAA 

TGATAAAGCTTTTGTAGTTGGAGGAACAGGATTAGCGGATGCTATGAGTATAGCTCCAGTTGC 

TTCTCAATTAAGAAACTCAAATGGAGAACTTGACTTAAAAGGTGATGCAACTCCAATAGTAGT 

TGTTGATGGAAAAGCTAAAGATATAAATTCTGAAGTAAAAGATTTCTTAGATGATTCACAAGT 

TGATATAATAGGTGGTGTAAATAGTGTTTCTAAAGAAGTAATGGAAGCAATAGATGATGCTAC 

TGGAAAATCACCTGAGAGATATAGTGGAGAAGATAGACAAGCAACAAATGCTAAAGTTATAA 

AAGAAGATGATTTCTTTAAAAATGGAGAAGTTACAAACTTCTTTGTAGCTAAAGATGGTTCAA 

CTAAAGAAGATCAATTAGTAGATGCTTTAGCAGGTGCTGCAATTGCTGGTAACTTTGGTGTAA 

CAGTAGATAATGAAGGAAAACCTACAGTTGCTGATAAAAAAGCTTCTCCAGCACCAATTGTTT 

TAGCAACAGATTCTTTATCTTCTGATCAAAATGTAGCTATAAGTAAAGCTGTAAATGATGACG 

CTAATACTAAGAATCTAGTTCAAGTTGGTAAAGGTATAGCTACTTCAGTTGTAAGTAAAATAA 

AAGATTTATTAGATATG 

SEQ ID No. 9 
(Strain 170444 DNA) 

ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCTCCT 

GTTTTTGCTGCAACTACTGGAACACAAGGTTATACTGTAGTTAAAAACGACTGGAAAAAAGCA 

GTAAAACAATTACAAGATGGACTAAAAGATAATAGTATAGGAAAGATAACTGTATCTTTTAA 

TGATGGGGTTGTGGGTGAAGTAGCTCCTAAAAGTGCTAATAAGAAAGCGGACAGAGATGCTG 

CAGCTGAGAAGTTATATAATCTTGTTAACACTCAATTAGATAAATTAGGTGATGGAGATTATG 

TTGATTTTTCTGTAGATTATAATTTAGAAAAAAAAATAATAACTAATCAAGCAGATGCAGAAG 

CAATTGTTACAAAGTTAAATTCACTTAATGAGAAAACTCTTATTGATATAGCAACTAAAGATA 

CTTTTGGAATGGTTAGTAAAACACAAGATAGTGAAGGTAAAAATGTTGCTGCAACAAAGGCA 

CTTAAAGTTAAAGATGTTGCTACATTTGGTTTGAAGTCTGGTGGAAGCGAAGATACTGGATAT 

GTTATTGAAATGAAAGCAGGAGCTGTAGAGGATAAGTATGGTAAAGTTGGAGATAGTACGGC 

AGGTATTGCAATAAATCTTCCTAGTACTGGACTTGAATATGCAGGTAAAGGAACAACAATTGA 

TTTTAATAAAACTTTAAAAGTTGATGTAACAGGTGGTTCAACACCTAGTGCTGTAGCTGTAAG 

TGGTTTTGTAACTAAAGATGATACTGATTTAGCAAAATCAGGTACTATAAATGTAAGAGTTAT 

AAATGCAAAAGAAGAATCAATTGATATAGATGCAAGCTCATATACATCAGCTGAAAATTTAG 

CTAAAAGACATGTATTTGATCCAGATGAAATTTCTGAAGCATATAAGGCAATAGTAGCATTAC 

AAAATGATGGTATAGAGTCTAATTTAGTTCAGTTAGTTAATGGAAAATATCAAGTGATTTTTT 

ATCCAGAAGGTAAAAGATTAGAAACTAAATCAGCAAATGATACAATAGCTAGTCAAGATACA 

CCAGCTAAAGTAGTTATAAAAGCTAATAAATTAAAAGATTTAAAAGATTATGTAGATGATTTA 

AAAACATATAATAATACTTATTCAAATGTTGTAACAGTAGCAGGAGAAGATAGAATAGAAAC 

TGCTATAGAATTAAGTAGTAAATATTATAATTCTGATGATAAAAATGCAATAACTGATAAAGC 

AGTTAATGATATAGTATTAGTTGGATCTACATCTATAGTTGATGGTCTTGTTGCATCACCATTA 

GCTTCAGAAAAAACAGCTCCATTATTATTAACTTCAAAAGATAAATTAGATTCATCAGTAAAA 

TCTGAAATAAAGAGAGTTATGAACTTAAAGAGTGACACTGGTATAAATACTTCTAAAAAAGTT 

TATTTAGCTGGTGGAGTTAATTCTATATCTAAAGATGTAGAAAATGAATTGAAAAACATGGGT 

CTTAAAGTTACTAGATTATCAGGAGAAGACAGATACGAAACTTCTTTAGCAATAGCTGATGAA 

ATAGGTCTTGATAATGATAAAGCATTTGTAGTTGGTGGTACTGGATTAGCAGATGCTATGAGT 

ATAGCTCCAGTTGCTTCTCAACTTAAAGATGGAGATGCTACTCCAATAGTAGTTGTAGATGGA 

AAAGCAAAAGAAATAAGTGATGATGCTAAGAGTTTCTTAGGAACTTCTGATGTTGATATAATA 

GGTGGAAAAAATAGCGTATCTAAAGAGATTGAAGAGTCAATAGATAGTGCAACTGGAAAAAC 

TCCAGATAGAATAAGTGGAGATGATAGACAAGCAACTAATGCTGAAGTTTTAAAAGAAGATG 



WO 02/062379 PCT/IE02/00017 



ATTATTTCACAGATGGTGAAGTTGTGAATTACTTTGTTGCAAAAGATGGTTCTACTAAAGAAG 

ATCAATTAGTAGATGCCTTAGCAGCAGCACCAATAGCAGGTAGATTTAAGGAGTCTCCAGCTC 

CAATCATACTAGCTACTGATACTTTATCTTCTGACCAAAATGTAGCTGTAAGTAAAGCAGTTC 

CTAAAGATGGTGGAACTAACTTAGTTCAAGTAGGTAAAGGTATAGCTTCTTCAGTTATAAACA 

AAATGAAAGATTTATTAGATATGA 



SEQ ID No. 10 
(Strain 170426 DNA) 

ATGAATAAGAAAAATATAGCAATAGCTATGTCAGGTTTAACAGTTTTAGCTTCGGCTGCTCCT 

GTTTTTGCTGCAACTACTGGAACACAAGGTTATACTGTAGTTAAAAACGACTGGAAAAAAGCA 

GTAAAACAATTACAGGATGGACTAAAAGATAATAGTATAGGAAAGATAACTGTATCTTTTAA 

TGATGGGGTTGTGGGTGAAGTAGCTCCTAAAAGTGCTAATAAGAAAGCGGACAGAGATGCTG 

CAGCTGAGAAGTTATATAATCTTGTTAACACTCAATTAGATAAATTAGGTGATGGAGATTATG 

TTGATTTTTCTGTAGATTATAATTTAGAAAAAAAAATAATAACTAATCAAGCAGATGCAGAAG 

CAATTGTTACAAAGTTAAATTCACTTAATGAGAAAACTCTTATTGATATAGCAACTAAAGATA 

CTTTTGGAATGGTTAGTAAAACACAAGATAGTGAAGGTAAAAATGTTGCTGCAACAAAGGCA 

CTTAAAGTTAAAGATGTTGCTACATTTGGTTTGAAGTCTGGTGGAAGCGAAGATACTGGATAT 

GTTGTTGAAATGAAAGCAGGAGCTGTAGAGGATAAGTATGGTAAAGTTGGAGATAGTACGGC 

AGGTATTGCAATAAATCTTCCTAGTACTGGACTTGAATATGCAGGTAAAGGAACAACAATTGA 

TTTTAATAAAACTTTAAAAGTTGATGTAACAGGTGGTTCAACACCTAGTGCTGTAGCTGTAAG 

TGGTTTTGTAACTAAAGATGATACTGATTTAGCAAAATCAGGTACTATAAATGTAAGAGTTAT 

AAATGCAAAAGAAGAATCAATTGATATAGATGCAAGCTCATATACATCAGCTGAAAATTTAG 

CTAAAAGATATGTATTTGATCCAGATGAAATTTCTGAAGCATATAAGGCAATAGTAGCATTAC 

AAAATGATGGTATAGAGTCTAATTTAGTTCAGTTAGTTAATGGAAAATATCAAGTGATTTTTT 

ATCCAGAAGGTAAAAGATTAGAAACTAAATCAGCAAATGATACAATAGCTAGTCAAGATACA 

CCAGCTAAAGTAGTTATAAAAGCTAATAAATTAAAAGATTTAAAAGATTATGTAGATGATTTA 

AAAACATATAATAATACTTATTCAAATGTTGTAACAGTAGCAGGAGAAGATAGAATAGAAAC 

TGCTATAGAATTAAGTAGTAAATATTATAATTCTGATGATAAAAATGCAATAACTGATAAAGC 

AGTTAATGATATAGTATTAGTTGGATCTACATCTATAGTTGATGGTCTTGTTGCATCACCATTA 

GCTTCAGAAAAAACAGCTCCATTATTATTAACTTCAAAAGATAAATTAGATTCATCAGTAAAA 

TCTGAAATAAAGAGAGTTATGAACTTAAAGAGTGACACTGGTATAAATACTTCTAAAAAAGTT 

TATTTAGCTGGTGGAGTTAATTCTATATCTAAAGATGTAGAAAATGAATTGAAAAACATGGGT 

CTTAAAGTTACTAGATTATCAGGAGAAGACAGATACGAAACTTCTTTAGCAATAGCTGATGAA 

ATAGGTCTTGATAATGATAAAGCATTTGTAGTTGGTGGTACTGGATTAGCAGATGCTATGAGT 

ATAGCTCCAGTTGCTTCTCAACTTAAAGATGGAGATGCTACTCCAATAGTAGTTGTAGATGGA 

AAAGCAAAAGAAATAAGTGATGATGCTAAGAGTTTCTTAGGAACTTCTGATGTTGATATAATA 

GGTGGAAAAAATAGCGTATCTAAAGAGATTGAAGAGTCAATAGATAGTGCAACTGGAAAAAC 

TCCAGATAGAATAAGTGGAGATGATAGACAAGCAACTAATGCTGAAGTTTTAAAAGAAGATG 

ATTATTTCACAGATGGTGAAGTTGTGAATTACTTTGTTGCAAAAGATGGTTCTACTAAAGAAG 

ATCAATTAGTAGATGCCTTAGCAGCAGCACCAATAGCAGGTAGATTTAAGGAGTCTCCAGCTC 

CAATCATACTAGCTACTGATACTTTATCTTCTGACCAAAATGTAGCTGTAAGTAAAGCAGTTC 

CTAAAGATGGTGGAACTAACTTAGTTCAAGTAGGTAAAGGTATAGCTTCTTCAGTTATAAACA 

AAATGAAAGATTTATTAGATATG 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 

This International Search Report has not been established in respsct of certain claims under Article 17(2)(a) for the following reasons: 

1. [)(] Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 

Although claims 57, 63 and 64 are directed to a method of treatment of the 
human/animal body, the search has been carried out and based on the alleged 
effects of the compound/ compo s i ti on. 

2. X Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 

an extent that no meaningful International Search can be carried out, specifically: 

see FURTHER INFORMATION sheet PCT/ISA/210 



3. f Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 

see additional sheet 



1 . I | As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
I • searchable claims. 

2. | | As all searchable claims could be searched without effort Justifying an additional fee, this Authority did not invite payment 

of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
I 1 covers only those claims for which fees were paid, specifically claims Nos.: 



4. | y | No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

(9, 11. 21. 42, 44) - (1-8, 19, 20, 23-41, 52-64) - partial 



Remark on Protest | | The additional search fees were accompanied by the applicant's protest. 

| ^ No protest accompanied the payment of additional search fees. 
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International Application No. PCT/IE 02/G0O17 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: (9, 11, 21, 42, 44) - complete; (1-8, 19, 20, 
23-41, 52-64) - partial 



Clostridium difficile S layer protein (SlpA) comprising SEQ 
ID N0:1 and its corresponding gene (slpA), which comprises 
SEQ ID N0:3; epitopes, hornologs, derivatives, variants or 
fragments thereof. Chimeras comprising any of the previously 
mentioned polynucleotides/(poly)peptides. Antibodies against 
those (poly) peptides. Vaccines comprising any of the former 
and methods for prophylaxis/treatment of C. * 
difficile-associated diseases based on the use thereof. 



2. Claims: (10, 13, 14, 17, 18, 22, 43, 46, 47, 50, 
51) - complete; (1-8, 19, 20, 23-41, 
52-64) - partial 



Idem as subject 1, but restricted to a gene comprising SEQ 
ID MOs: 5, 6, 9 or 10 and a polypeptide/peptide comprising 
SEQ ID N0:2. 



3. Claims: (12, 45) - complete; (1-8, 19, 20, 23-37, 39-41, 
52-54, 56-64) - partial 



Idem as subject 1, but restricted to a gene comprising SEQ 
ID N0:4. 



4. Claims: (15, 48) - complete; (1-8, 19, 20, 23-37, 39-41, 
52-54, 56-64) - partial 



Idem as subject 1, but restricted to a gene comprising SEQ 
ID N0:7. 



5. Claims: (16, 49) - complete; (1-8, 19, 20, 23-37, 39-41, 
52-54, 56-64) - partial 



Idem as subject 1, but restricted to a gene comprising SEQ 
ID N0:8. 



6. Claims: (65) - complete 

Use of interleukin 12 as an adjuvant in a C. difficile 
vaccine. 
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7. Claims: (66) - partial 

Use of humanised antibodies for passive vaccination of an 
individual with C. difficile infection. 

8. Claims: (66) - partial 

Use of serum for passive vaccination of an individual with 
C. difficile infection. 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



Continuation of Box 1.2 



Please notice that: 

1. The translations of the ORFs contained in appendices 1-8 are not 
included in the sequence listing and, therefore, the one corresponding to 
the first invention has not been searched. In case the applicant decided 
to pay additional fees, he should be aware of the fact that the same will 
apply to the other inventions.. 

2. Claims 39 and 56 refer to SEQ ID NOs:3-10 as "amino acid sequences" 
but, actually, they are nucleotidic sequences. 

3. The sequence numbering is confusing. The sequence identity numbers 
mentioned in the description and claims do not correspond with those of 
the sequence listing (example: SEQ ID N0:1 of the description is a 
peptide which appears under SEQ ID N0:9 of the sequence listing). 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT) . The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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